- Notifications
You must be signed in to change notification settings - Fork28
A database of public bodies such as government departments, ministries etc.
License
datasets/publicbodies
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
A database of public bodies (or organizations):
Government-run or controlled organizations or entities which may or may nothave distinct corporate existence
Examples are:
- Government Ministries or Departments
- State-run Health organizations
- Police and fire departments
Visit the site:https://publicbodies.org/
Data is stored in CSVs partitioned by country or region (e.g. EU) in thedata folder. Files are named by two-letter ISO code.
Please just add a CSV file and submit a pull request oropen an issue.
The set of fields required in the CSV file can be seen in the field list on:public-body-schema.json. You can also check outthe existing data indata/
for hints. To learn more about DataPackages, visithttps://specs.frictionlessdata.io/.
If you can,developing a botto automatically and periodically collect the data is even better.
The website is a Jekyll site. To get it running locally:
InstallDocker.
Get the code
git clone https://github.com/okfn/publicbodiescd publicbodies
Run Jekyll
cd websiteexport JEKYLL_VERSION=4.2.0docker run --rm --volume="$PWD:/srv/jekyll" -it jekyll/minimal:$JEKYLL_VERSION jekyll build --baseurl$PWD/_site/ --watch
The built website will appear on the
website/_site
folder.
The list of outstanding issues is at:https://github.com/okfn/publicbodies/issues
Data is kept automatically up-to-date by bots that collect and updatedata once a week. The scripts are kept on thescripts/import directory, followed by the internationalplace code (e.g.br
for Brazil,it
for Italy).
The script MUST be runnable from a command line interface. It shoulddisplay the available options if run with the--help
parameter, andoutput data to the file chosen by the--output
parameter. For example:
python3 scripts/import/br/import_br.py --help
usage: import_br.py [-h] [--output file_name]Imports Brazilian public body data from the official source and complements itwith data from several auxiliary sources. Official source: [SIORG's open dataAPI](https://dados.gov.br/dataset/siorg)optional arguments: -h, --help show this help message and exit --output file_name filename for the data output as CSV
When making requests, bots MUST use the Public Bodies Bot user agent stringto identify themselves to servers:
PublicBodiesBot (https://github.com/okfn/publicbodies)
If using Python, use the same libraries already defined inscrips/requirements.txt, in order to keepthe project dependencies tidy, and only add new ones if strictlynecessary.
After creating a new bot, make sure to add it to theupdate data workflow so that it runsregularly and keeps the data up-to-date.
Details of the automated data extraction to build the original database.
Data sources:
- Brazil
- Brazilian Government's SIORG –https://dados.gov.br/dataset/siorg
- European Union
- Italy
- Opendata IPA(amministrazioni/enti)
- Germany
- FragDenStaat.de – (private GoogleDoc)
- Bund.de –https://www.bund.de/Content/DE/Behoerden/Suche/Formular.html
- United Kingdom
- WhatDoTheyKnow.com –https://www.whatdotheyknow.com/body/all-authorities.csv
- United States of America
- A-Z Index of U.S. Government Departments and Agencies –https://www.usa.gov/federal-agencies/a