Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Datasets:
nvidia
/
Nemotron-Pretraining-Code-v3
like
13
Follow
NVIDIA
59.2k
Tasks:
Text Generation
Modalities:
Text
Formats:
parquet
Languages:
code
Size:
100M - 1B
Tags:
text
pre-training
human
legal
Nemotron_3_Ultra
Libraries:
Datasets
pandas
Polars
+ 1
License:
cc-by-4.0
Dataset card
Data Studio
Files
Files and versions
xet
Community
main
Nemotron-Pretraining-Code-v3
8.22 GB
3 contributors
History:
2 commits
leannachr
Update README.md
9b42fea
verified
2 days ago
Nemotron-Code-Metadata
initial commit
3 days ago
.gitattributes
Safe
2.5 kB
initial commit
3 days ago
LICENSE
Safe
0 Bytes
initial commit
3 days ago
README.md
Safe
6.68 kB
Update README.md
2 days ago