Skip to content

Add PDF text fallback for bill documents#2121

Open
Smoss wants to merge 1 commit intocodeforboston:mainfrom
Smoss:t3code/4d25cd43
Open

Add PDF text fallback for bill documents#2121
Smoss wants to merge 1 commit intocodeforboston:mainfrom
Smoss:t3code/4d25cd43

Conversation

@Smoss
Copy link
Copy Markdown

@Smoss Smoss commented May 6, 2026

  • Extract embedded text from bill PDFs when DocumentText is missing
  • Add backfill tooling and documentation for repairing existing bills

Summary

Add basic pdf parsing as a fallback for when DocumentText is unavailable, per #2081.
This still fails on several PDFs due to lack of text, but works for a large portion.

Checklist

  • On the frontend, I've made my strings translate-able.
  • If I've added shared components, I've added a storybook story.
  • I've made pages responsive and look good on mobile.
  • If I've added new Firestore queries, I've added any new required indexes to firestore.indexes.json (Please do not only create indexes through the Firebase Web UI, even though the error messages may reccommend it - indexes created this way may be obliterated by subsequent deploys)

Screenshots

image

Known issues

If you've run against limitations or caveats, include them here. Include follow-up issues as well.

Steps to test/reproduce

For each feature or bug fix, create a step by step list for how a reviewer can test it out. E.g.:

  1. Go to the home page
  2. Click on a testimony
  3. See that it's loaded with a loading spinner

- Extract embedded text from bill PDFs when DocumentText is missing
- Add backfill tooling and documentation for repairing existing bills
@vercel
Copy link
Copy Markdown

vercel Bot commented May 6, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
maple-dev Ready Ready Preview, Comment May 6, 2026 3:25am

Request Review

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant