Releases: sodascience/disease_database
Historical disease database version 1.0.0
This is the first stable version of the disease database. It is created by performing regex searches for locations (municipalities) and diseases in historical newspaper articles from 1830-1940. There is not much post-processing, so note that there are likely still several issues. Please be careful when using this dataset, perform your own quality checks. When in doubt, contact us via GitHub.
Further improvements are scheduled for the next versions, such as limiting the location/disease distance in the article text, more advanced text processing, post-hoc space-time smoothing, and more.
What's Changed
- Database creation pipeline by @vankesteren in #11
Full Changelog: v0.2.0-alpha...v1.0.0
Historical diasease database version 0.2.0-alpha
This is the first trial release. Download the database via the attached zip file. NB: this trial release is NOT production ready, with several issues needing to be solved before this can be used in real-world analyses.
What's Changed
- added pipeline for harvesting data from delpher api; improved the old… by @fqixiang in #3
- updated README.md and added delpher_api/README.md; minor bug fixes by @fqixiang in #4
- added requirements.txt by @fqixiang in #5
- move to pyproject.toml by @vankesteren in #6
- Overhaul folder structure & streamline workflow by @vankesteren in #7
- double-check on EJ's previous PR by @fqixiang in #8
- Add faster query (6 minutes) and uncertainty by @vankesteren in #9
- Produced first images for presentation by @kristinathompson in #10
New Contributors
- @fqixiang made their first contribution in #3
- @vankesteren made their first contribution in #6
- @kristinathompson made their first contribution in #10
Full Changelog: https://github.com/sodascience/disease_database/commits/v0.1