CRS Entity matching #37

Open
opened 2020-06-27 09:06:24 +02:00 by nickdickinson · 1 comment

We need a way to start matching organizations across different WASH dataset with wikidata. I'm opening this issue do aside from manual matching, we can discuss alternatives and find better solutions. Such as the approach here:
https://github.com/cwrc/wikidata-entity-lookup

We need a way to start matching organizations across different WASH dataset with wikidata. I'm opening this issue do aside from manual matching, we can discuss alternatives and find better solutions. Such as the approach here: https://github.com/cwrc/wikidata-entity-lookup
Author
Owner

I've had good progress working with OpenRefine for assisted manual matching. Some matches are automatic but there is still significant manual effort in the first round. Neat thing is that there is an API and we can control OpenRefine and repeat procedures using Python. It is also well documented/supported in the wikidata space: https://www.wikidata.org/wiki/Wikidata:Tools/OpenRefine/Editing

I've had good progress working with OpenRefine for assisted manual matching. Some matches are automatic but there is still significant manual effort in the first round. Neat thing is that there is an API and we can control OpenRefine and repeat procedures using Python. It is also well documented/supported in the wikidata space: https://www.wikidata.org/wiki/Wikidata:Tools/OpenRefine/Editing
nickdickinson added this to the WASHWeb-2019 project 2023-11-14 10:49:54 +01:00
Sign in to join this conversation.
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
The due date is invalid or out of range. Please use the format 'yyyy-mm-dd'.

No due date set.

Dependencies

No dependencies set.

Reference: WASHWeb/WASHWeb-2019#37
No description provided.