Skip to content

Issues: NVIDIA/NeMo-Curator

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

fuzzy dedup in cpu enhancement New feature or request
#101 opened Jun 7, 2024 by simplew2011
Fuzzy dedup error if partition wise indices do not start from 0 bug Something isn't working
#48 opened May 2, 2024 by ayushdg
[FEA] Add examples showing how to use both CPU & GPU modules together documentation Improvements or additions to documentation enhancement New feature or request
#65 opened May 15, 2024 by ayushdg
[FEA] Update read_json to work with s3 paths. enhancement New feature or request
#66 opened May 15, 2024 by ayushdg
find_pii_and_deidentify example fails bug Something isn't working
#85 opened May 28, 2024 by randerzander
Fix noisy Dask shutdown
#9 opened Mar 20, 2024 by ryantwolf
Update download documentation to include client creation bug Something isn't working documentation Improvements or additions to documentation
#100 opened Jun 6, 2024 by moutasemalakkad
Remove Numpy<2.0 pin meta General NeMo-Curator maintenance/packaging
#120 opened Jun 18, 2024 by ayushdg
Running into OOM with add id bug Something isn't working
#142 opened Jul 8, 2024 by yyu22
[META] Update python version to include python 3.11 meta General NeMo-Curator maintenance/packaging
#188 opened Aug 6, 2024 by VibhuJawa
Pandas and cuDF DataFrames in DocumentDataset bug Something isn't working
#195 opened Aug 8, 2024 by sarahyurick
[FEA] Add license detector for code repositories enhancement New feature or request
#208 opened Aug 15, 2024 by miguelusque
Grammar and punctuation nits in Jupyter Notebooks documentation Improvements or additions to documentation good first issue Good for newcomers
#228 opened Sep 4, 2024 by sarahyurick
16 tasks
Running Curator under SLURM Cluster
#531 opened Feb 7, 2025 by philm001
ProTip! Follow long discussions with comments:>50.