Monday, 6 June 2016

data - Retrieving Wikipedia's coordinate information?


I'd like to add icons to my map showing locations which have Wikipedia entries.


The WikiLocation API is a great resource for determining Wiki entries near an XY location, but it has limitations (20km radius, 50 points only) which make it unsuitable for my purposes.


Ideally I need the map to show ALL locations with Wiki entries, so I guess I'll need to download the Wikipedia dump file and parse it for geocoded entries.


Where is this information found in the Wikipedia dump files?



The Wiki database may be downloaded from http://en.wikipedia.org/wiki/Wikipedia:Database_download but I can't see which file(s) I need to search for the coordinates.


For example, the entry for Sydney includes:


Coordinates:    33°51′35.9″S 151°12′40″E

Where is this found in the dump files? Another way to phrase the question is "how can I determine which Wiki entries are geo-coded?"



Answer



See the database dump section on http://de.wikipedia.org/wiki/Wikipedia:WikiProjekt_Georeferenzierung/Wikipedia-World/en. This gives the option for a PostGIS connection.


There was also a CSV download option at one time but that seems to have been removed sometime between 2011 and 2016.


Also look through http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Geographical_coordinates for other options.


No comments:

Post a Comment

arcpy - Changing output name when exporting data driven pages to JPG?

Is there a way to save the output JPG, changing the output file name to the page name, instead of page number? I mean changing the script fo...