Releases: fsecada01/TextSpitter
Releases · fsecada01/TextSpitter
TextSpitter v1.0.0
Production Release 🚀
TextSpitter v1.0.0 is now available on PyPI!
What's New
Core Features
- 📄 Multi-format extraction: PDF (PyMuPDF + PyPDF fallback), DOCX, TXT, CSV, and 50+ source code file types
- 🔌 Stream-first API: Supports file paths, BytesIO, SpooledTemporaryFile, and raw bytes
- 🛠️ Optional logging: Built-in loguru support with automatic stdlib logging fallback
- 🖥️ CLI tool:
textspittercommand for quick one-off extractions - 📚 Professional documentation: Landing page, quick-start, tutorials, use cases, and recipes
Quality Assurance
- 🧪 ~80 pytest tests with 89%+ code coverage
- ✨ Full type hints with PEP 561 marker
- 🔒 Pre-commit hooks (ruff, black, isort, ty)
- 🚀 Automated CI/CD on Python 3.12–3.14
- 📖 Auto-published docs to GitHub Pages
Installation
pip install textspitter
# With optional loguru logging
pip install "textspitter[logging]"Quick Example
from TextSpitter import TextSpitter
text = TextSpitter(filename="report.pdf")
print(text[:200])Documentation
Key Improvements in v1.0.0
✅ Fixed CI/CD type checking for Python 3.12–3.14
✅ Added system dependencies for lxml build on Python 3.14
✅ Replicated SQLModel-CRUD-Utilities documentation approach
✅ Created professional landing page with feature cards
✅ Complete documentation pages with proper navigation
✅ Fixed code block whitespace formatting
✅ All pre-commit hooks passing
Thanks for using TextSpitter! 🎉
v0.4.0
What's Changed
- Bump lxml from 4.6.4 to 4.6.5 by @dependabot in #6
New Contributors
- @dependabot made their first contribution in #6
Full Changelog: 0.3.6...v0.4.0
0.4.0.b2
Full Changelog: 0.3.7.b2...0.4.0.b2
0.3.7.b2
Full Changelog: 0.3.7.b1...0.3.7.b2
0.3.7b0
What's Changed
- Bump lxml from 4.6.4 to 4.6.5 by @dependabot in #6
New Contributors
- @dependabot made their first contribution in #6
Full Changelog: 0.3.6...0.3.7b0
Patch: `TextSpitter.core`
patching TextSpitter.core to explicitly convert filename contents into File objects