Tag Archives: wikidata

Wikidata Class Hierarchy

The latest update to the wd-extract.py program in my wikidata project (see https://github.com/jimbelton/wikidata for the code) includes the ability to extract the class hierarchy from a wikidata dump file. I’ve also added a new program, wd-diagram.py, that can render the … Continue reading

Posted in programming | Tagged , , , | Leave a comment

Wikidata Anomalies

I’ve found some interesting anomalies in the latest wikidata dump (from 2016-02-15). There are two properties referenced by objects that are not defined. The numbers in parentheses are the line numbers of the objects in the dump that reference the … Continue reading

Posted in programming | Tagged | 2 Comments

Wikidata Extraction Tool on GitHub

I just pushed the first bit of a tool for extracting information from wikidata.org. See my previous article for a description of the dump format. The tool can be found here: https://github.com/jimbelton/wikidata It doesn’t do a lot so far, but … Continue reading

Posted in programming | Tagged , , | Leave a comment

Wikidata JSON Dump File Format

Data dumps files: Weekly dumps of the entire Wikidata database can be downloaded from http://dumps.wikimedia.org/other/wikidata/ JSON (javascript object notation) is the recommended format JSON dump files are named YYYYMMDD.json File format: The file is encoded as a single list (i.e. … Continue reading

Posted in programming, wikidata | Tagged , , | Leave a comment