- Notifications
You must be signed in to change notification settings - Fork35
Dataset of Rust source code repositories
License
rust-lang/rust-repos
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
This repository contains a scraped list of all the public GitHub repos with source codewritten in theRust programming language. The source code for the scraper isalso included.
Everything in this repository, unless otherwise specified, is released underthe MIT license.
To run the scraper, execute the program with theGITHUB_TOKEN
environmentvariable (containing a valid GitHub API token -- no permissions are required)and the data directory as the first argument:
$ GITHUB_TOKEN=foobar cargo run --release -- data
The scraper automatically saves its state to disk, so it can be interrupted andit will resume where it left. This also allows incremental updates of the list.
The data is available in thedata/github.csv
file, in CSV format. That filecontains the GitHub GraphQL ID of the repository, its name, and whether itcontains aCargo.toml
andCargo.lock
.
All the repositories contained in the dataset are marked as using the languageby GitHub. Some results might be inaccurate for this reason.