Skip to content

Releases: fsecada01/TextSpitter

TextSpitter v1.0.0

18 Feb 05:21

Choose a tag to compare

Production Release 🚀

TextSpitter v1.0.0 is now available on PyPI!

What's New

Core Features

  • 📄 Multi-format extraction: PDF (PyMuPDF + PyPDF fallback), DOCX, TXT, CSV, and 50+ source code file types
  • 🔌 Stream-first API: Supports file paths, BytesIO, SpooledTemporaryFile, and raw bytes
  • 🛠️ Optional logging: Built-in loguru support with automatic stdlib logging fallback
  • 🖥️ CLI tool: textspitter command for quick one-off extractions
  • 📚 Professional documentation: Landing page, quick-start, tutorials, use cases, and recipes

Quality Assurance

  • 🧪 ~80 pytest tests with 89%+ code coverage
  • ✨ Full type hints with PEP 561 marker
  • 🔒 Pre-commit hooks (ruff, black, isort, ty)
  • 🚀 Automated CI/CD on Python 3.12–3.14
  • 📖 Auto-published docs to GitHub Pages

Installation

pip install textspitter

# With optional loguru logging
pip install "textspitter[logging]"

Quick Example

from TextSpitter import TextSpitter

text = TextSpitter(filename="report.pdf")
print(text[:200])

Documentation

Key Improvements in v1.0.0

✅ Fixed CI/CD type checking for Python 3.12–3.14
✅ Added system dependencies for lxml build on Python 3.14
✅ Replicated SQLModel-CRUD-Utilities documentation approach
✅ Created professional landing page with feature cards
✅ Complete documentation pages with proper navigation
✅ Fixed code block whitespace formatting
✅ All pre-commit hooks passing

Thanks for using TextSpitter! 🎉

v0.4.0

30 Jun 20:16

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.3.6...v0.4.0

0.4.0.b2

11 Jun 03:28
4e78fef

Choose a tag to compare

0.4.0.b2 Pre-release
Pre-release

Full Changelog: 0.3.7.b2...0.4.0.b2

0.3.7.b2

14 Dec 04:16

Choose a tag to compare

0.3.7.b2 Pre-release
Pre-release

Full Changelog: 0.3.7.b1...0.3.7.b2

0.3.7b0

23 Sep 15:30
fb64a60

Choose a tag to compare

0.3.7b0 Pre-release
Pre-release

What's Changed

New Contributors

Full Changelog: 0.3.6...0.3.7b0

Patch: `TextSpitter.core`

26 Nov 16:54
5bb8bd9

Choose a tag to compare

Pre-release

patching TextSpitter.core to explicitly convert filename contents into File objects

0.3.6

10 Nov 14:30

Choose a tag to compare

Fix for extracting string content from text files. Typo fix in README.

0.3.4

26 Sep 20:30

Choose a tag to compare

  • Requirements Updates
  • Moved utility functions into class wrapper