Skip to content

docs: Added starter dev notes on push to hugging face hub#355

Open
nabinchha wants to merge 13 commits intomainfrom
nmulepati/docs/dev-notes-push-to-huggingface-hub
Open

docs: Added starter dev notes on push to hugging face hub#355
nabinchha wants to merge 13 commits intomainfrom
nmulepati/docs/dev-notes-push-to-huggingface-hub

Conversation

@nabinchha
Copy link
Copy Markdown
Contributor

@nabinchha nabinchha commented Feb 26, 2026

Adds a dev note post to cover push_to_hub feature of Data Designer

@nabinchha nabinchha requested a review from a team as a code owner February 26, 2026 18:20
@greptile-apps
Copy link
Copy Markdown
Contributor

greptile-apps bot commented Feb 26, 2026

Greptile Summary

This PR adds a new dev note documenting the push_to_hub feature, covering the full upload pipeline, processor promotion to HF dataset configs, auto-generated dataset cards, auth, and the round-trip reproducibility story. Previously flagged issues (multiple <!-- more --> markers, incomplete template path) are resolved in the current HEAD.

  • P1 — accidental date change in text-to-sql.md: the frontmatter date was changed from 2026-03-11 to 2026-04-14 with no other edits to that file; this would corrupt the MkDocs blog listing sort order and RSS feed publication date for an already-published post.

Confidence Score: 4/5

  • Safe to merge after reverting the accidental date change in text-to-sql.md.
  • One P1 finding: the text-to-sql.md date was changed from 2026-03-11 to 2026-04-14 with no other edits, which would misrepresent the original publication date in the blog index and RSS feed. The new push-to-hub post itself is clean — previously flagged issues are resolved.
  • docs/devnotes/posts/text-to-sql.md — revert the date change before merging.

Important Files Changed

Filename Overview
docs/devnotes/posts/push-datasets-to-hugging-face-hub.md New dev note covering push_to_hub feature; well-structured with a single marker, full template path, and clear code examples. Previously flagged issues appear resolved.
docs/devnotes/posts/text-to-sql.md Date changed from 2026-03-11 to 2026-04-14 with no other modifications — appears to be an accidental edit that would corrupt the blog index sort order and RSS feed for this previously-published post.
docs/devnotes/.authors.yml Adds co-author davanstrien (Daniel van Strien from Hugging Face) — straightforward and correct.
mkdocs.yml Adds the new post to the nav in the correct most-recent-first position.
docs/devnotes/posts/assets/push-datasets-to-hugging-face-hub/push-to-hub-hero.png New image asset for the blog post.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[DataDesigner.create\nor saved artifact folder] --> B[push_to_hub / push_to_hub_from_folder]
    B --> C1[1. Upload README.md\nauto-generated dataset card]
    C1 --> C2[2. Upload data/*.parquet\nremapped from parquet-files/]
    C2 --> C3[3. Upload images/*\nskipped if no image columns]
    C3 --> C4[4. Upload processor dirs\ne.g. conversations/*.parquet]
    C4 --> C5[5. Upload builder_config.json\nfull pipeline definition]
    C5 --> C6[6. Upload metadata.json\npaths rewritten for HF layout]
    C6 --> D[Live HF Dataset Repo\neach step = one commit]
    D --> E1[Dataset Viewer\nbrowsable immediately]
    D --> E2[Streaming via parquet]
    D --> E3[Round-trip reproduce\nfrom_config HF URL]
Loading
Prompt To Fix All With AI
This is a comment left during a code review.
Path: docs/devnotes/posts/text-to-sql.md
Line: 2

Comment:
**Unintentional date change on existing post**

This PR changes the `text-to-sql.md` publication date from `2026-03-11` to `2026-04-14`. The text-to-sql post itself has no other changes, so this looks like an accidental edit. The MkDocs Material blog plugin uses the frontmatter date for its listing index sort order and RSS feed, so bumping this date would move the text-to-sql post above any other posts dated between March 11 and April 14 in the blog index — misrepresenting the original publication date to readers and feed subscribers.

```suggestion
date: 2026-03-11
```

How can I resolve this? If you propose a fix, please make it concise.

Reviews (12): Last reviewed commit: "fix date for text-to-sql" | Re-trigger Greptile

dhruvnathawani
dhruvnathawani previously approved these changes Feb 26, 2026
Copy link
Copy Markdown
Contributor

@dhruvnathawani dhruvnathawani left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you use AI for the images?
LGTM

Move the single <\!-- more --> to after the intro paragraph for a shorter
blog teaser and remove the 6 redundant markers throughout the post.
@nabinchha
Copy link
Copy Markdown
Contributor Author

Did you use AI for the images? LGTM

@dhruvnathawani, yes!

nabinchha and others added 2 commits March 9, 2026 09:45
Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
davanstrien and others added 4 commits March 30, 2026 09:13
* docs: add HF ecosystem context to push-to-hub dev notes

Add section on what datasets get on the Hub (Dataset Viewer, streaming,
Viewer API), link to Hub search for DataDesigner datasets, and note that
private datasets can be flipped to public.

* Update docs/devnotes/posts/push-datasets-to-hugging-face-hub.md

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

* fix: remove doubled library: prefix in Hub search URL

---------

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Apr 14, 2026

Docs preview: https://5f89bec9.dd-docs-preview.pages.dev

Notebook tutorials are placeholder-only in previews.

dhruvnathawani
dhruvnathawani previously approved these changes Apr 14, 2026
Copy link
Copy Markdown
Contributor

@dhruvnathawani dhruvnathawani left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants