This page has archives. Sections older than 31 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present.
Wikidata weekly summary #608
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here: What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship: EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot: Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming: Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers: Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest WikiProjects: WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers: Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos: WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs: PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks: Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project Software Collaboration for Wikidata prematurely. Read their joint statement here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (phab:T344041)
We are working on getting a sitelink for a given wiki (phab:T344039)
Here's your quick overview of what has been happening around Wikidata over the last week. Translations are available.
Discussions
New request for comments: Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next: Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation, register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm. Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (phab:T353807, phab:T352006) and creating temporary accounts when editing (phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (phab:T352644)
mul language code: We did user testing to find any remaining issue before release
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-01. Please help Translate.
Discussions
New requests for permissions/Bot:
DifoolBot 4 Task(s) - Split single references containing multiple reference URLs into multiple references.
Bot Bozze Task(s) - Add sitelinks to itwiki draft articles after they've been moved to the main namespace.
New request for comments: Spelling convention for labels and descriptions in English - RfC started 2024-06-25. This RfC requests feedback and input for finding consistency in spelling convention as English has multiple regional variations.
Past: The Lexicodays 2024 was an online event designed to offer a discussion space for the Wikidata community about Lexicographical Data. An archive of some of the slides and session recordings are here c:Category:Lexicodays 2024. More will be added as they become available.
Upcoming:
The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Botany-focused Wikidata online workshop online as part of the #IBC2024. Date: Tuesday 9th July at 9pm NZST (GMT+12) / 11 am central Europe. Register here!
Press, articles, blog posts, videos
Blogs
Querying for audio on Wikidata - This blog post discusses using SPARQL queries on Wikidata to find audio recordings, focusing on musical compositions and their associated genres.
Diff Blog: Imagining a Wikidata future for librarians together - the sixth and final blog post from the LD42023 conference. Silvia Gutiérrez (WMF) and Giovanna Fontenelle (WMF) document the results of the collaborative session on building a bridge between the Library-Wikidata community and WMF.
Library Knowledge as Linked Data: A Wikidata Approach: Contributing to a shared data commons. David Erlandson describes the experiences of using Wikidata for the pilot Program for Cooperative Cataloging to "accelerate the movement towards ubiquitous identifier creation and identity management at the network level".
User:Zvpunry/CreateNewItem - This is a User script to easily add a new Item while editing a Statement and noticing that the desired Item is missing.
Other Noteworthy Stuff
The second iteration of the Wikidata:Open Online Course has begun. Class will continue until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Newest WikiProjects: Inuktitut - This is the space to organize work to assure that the sum of all knowledge and the supporting infrastructure for necessary services are available in Inuktitut (ᐃᓄᒃᑎᑐᑦ, Inuktitut).
Here's your quick overview of what has been happening around Wikidata in the week leading up to 2024-12-09. Missed the previous one? See issue #656
Discussions
New requests for permissions/Bot: KlaraBot - Task(s): Append a human's lifespan to descriptions when they can be authoritatively sourced.
Closed request for comments: Audio transcription (P9533) - Closed with no consensus. The discussion is ongoing on the Property P5933 talk page.
Events
Past: Amical Wikimedia, the Catalan-language and culture focused thematic Wikimedia Organization organized the Celebrem Wikidata (Let's celebrate Wikidata) project to celebrate Wikidata's 12th anniversary, from November 10 - 30. This included a Wikidata introduction workshop to equip participants with the editing skills to tackle the project's main aim. This was presented as a game to delete duplicate info on Wikidata and Catalan Viquipèdia infoboxes, in three areas: protected buildings, officers' positions and data related to sports teams players. At the end of the event, ~200 Wikidata-fed infoboxes and Wikidata items were improved and many Wikipedia editors edited Wikidata for the first time!
(Deutsch)Wikidata for Legal Historians - Tue. 10 December, 3pm - 7pm (UTC+1). This presentation explores Wikidata as a key platform for LOD, explains its Semantic Web foundation, introduces FactGrid (a Wikidata-based platform for historical research). Highlights potential of both platforms using examples and encourages discussion for legal historical research. Register here.
Today (09.12.2024) is the last chance to submit an Abstract for the Wikidata and Research conference (5 - 6 June 2025). If you are interested in participating, please review the submission acceptance format before submitting here.
Press, articles, blog posts, videos
Blogs
MediaWiki Conference Highlights, featuring Wikibase talks including one by Christos Varvantakis and Jon Amar from Wikimedia Deutschland.
Ten years of Philippine local Govt. data for Wikidata's 12th Birthday. Read about SKAP's (Shared Knowledge Asia Pacific) efforts to add 10 years worth of financial data of local Government assets to Wikidata during a Datathon.
Papers
Developing an OCR - Wikibase Pipeline for Place Names in the RGTC Series - introduces a semi-automated workflow for extracting and digitally storing geographically relevant information, including spatial relations and contextual details, from place names in the Répertoire géographique des textes cunéiformes. By Matthew Ong (2024).
Videos
Wikibase4Research - Kolja Bailly presents ways in which the Wikibase4Research tool by the TIB Open Science Lab supports researchers in dealing with Mediawiki software for knowledge bases such as Wikibase and facilitates better and FAIR Research Data Management. Includes a live demonstration and beginner-friendly instructions.
Tool of the week
CAT🐈: Metrics computing simple metrics (number of labels, number of descriptions, number of sitelinks, number of statements) for item matching a simple claim.
Let's Connect invites you to get involved in helping spread awareness and knowledge of Wikidata, potentially help organise a Wikidata Learning Clinic. Are you interested in participating? Please sign-up on this registration form.
reference illustration (an illustration of this subject to provide a detailed reference for its appearance. It should be ideally tied to the primary literature on the item.)
Showcase Items: Das Erste: A German public service television channel broadcasting for more than 70 years.
Showcase Lexemes: Kerzu (L8153) the Breton word for December, directly translates from "totally black", rather appropriate for the cold, dark last month of the year.
Here's your quick overview of what has been happening around Wikidata in the week leading up to 2024-12-16. Missed the previous one? See issue #657
Discussions
New requests for permissions/Bot: PWSBot - Task(s): Is a selfmade chatbot to answer factual questions as part of a final research project for educational purposes.
Closed request for permissions/Bot: CarbonBot - Withdrawn by submitter
Next Linked Data for Libraries LD4 Wikidata Affinity Group session (Attn: Please fill out Pre-Participation Survey!) 17 December 2024: We have our next LD4 Wikidata Affinity Group Session on Tuesday, 17 December 2024 at 9 am PT / 12 pm ET / 17:00 UTC / 6 pm CET (Time zone converter) Wikimedian Mahir Morshed is leading a series of four sessions focused on lexicographical data in Wikidata. We are looking forward to learning more about these Wikibase entities! If you anticipate attending the workshop sessions, please fill out a brief survey linked from our Series Etherpad to help us prepare relevant materials for you. Sessions will be held on November 5, November 19, December 3, and December 17, 2024 at our regular time of 9am PT / 12pm ET / 17:00 UTC / 6pm CET. Event page
Baptiste de Coulon, "Les données liées, Wikidata et les archives : une opportunité de contribution aux communs numériques". In: La Gazette des archives, n°271, 2024-2, p.37-56 (free access online after 3 years).
Tabular Online Validator - checks if SPARQL query results conform to a provided schema by validating data and highlighting potential errors, such as missing properties, invalid values, or too many values, with the option to refine the schema if issues arise. (A major update to the current ShEx validator that is expected to get integrated into the existing validator soon)
New General datatypes property proposals to review:
About box (Screenshot of the About Box of the respective software (contains important information such as authors, license, version number and year(s) and is included in almost every software))
Wikibase REST API: We prototyped search support for the REST API and would like your feedback on it.
Property Suggestions: We updated the underlying data so you should have more up-to-date suggestions again when making new statements.
EntitySchemas: We continued the work on making it possible to search for EntitySchemas by their label and aliases when linking to them in a statement.
Query Service: We are investigating if we can do something about the issue where not all edgeLabels are shown on a graph visualisation (phab:T381857) and if there are any alternatives to the library used for the graph builder in the Query Service (phab:T381764)
Under the hood: We are optimizing the server setup for the term store to accommodate its growth (phab:T351802)
Ongoing: Wikidata Cleanup 2024 - Romaine continues his initiative, "Wikidata Cleanup," to coordinate community efforts in addressing the problem of items missing basic properties during the last ten days of 2024, when many users have extra time due to holidays. The aim is to improve data quality by focusing on ensuring all items have essential properties like "instance of" (P31) or "subclass of" (P279), adding relevant country and location data, and maintaining consistency within item series.
Upcoming events: Data Reuse Days - online event focusing on projects using Wikidata's data, 18-27 February 2025. You can submit a proposal for the program on the talk page until January 12th.
Press, articles, blog posts, videos
Blogs
Exploring YouTube Channels Via Wikidata, by Tara Calishain. "This time I'm playing with a way to browse YouTube channels while using Wikidata as context. And you can try it too, because it doesn't need any API keys!"
Flying Dehyphenator is an Ordia game. Given the start part of a word, use the spacebar to move the word and hit the next part of the word. Only hyphenations described with the Unicode hyphenation character work.
Want a wrap of your Wikidata activities in 2024? Wiki Year In Review has it for you! (use www.wikidata.org for the project URL)
Other Noteworthy Stuff
Wikibase/Suite-Contributing-Guide: Wikibase Suite's contributing guide has been published. This guide aims to help anyone who wants to contribute and make sure they are equipped with all the relevant information to do so.
New General datatypes property proposals to review:
About box (Screenshot of the About Box of the respective software (contains important information such as authors, license, version number and year(s) and is included in almost every software))
nonprofit tax status (country specific tax status of organisations like non-profits)
Here's your quick overview of what has been happening around Wikidata in the week leading up to 2024-12-30. Missed the previous one? See issue #659
Welcome to 2023’s Final Weekly Summary!
A huge thank you to everyone who contributed to the newsletter this year! 🎉 Each of your contributions, whether big or small, has made a difference and has helped us create a vibrant and informative resource for the Wikidata community. 🙏 Let's continue building and sharing knowledge together in the coming year! 🙌✨
Discussions
Open request for oversight: Ameisenigel (RfP scheduled to end at 6 January 2025 21:52 UTC)
Press, articles, blog posts, videos
Papers
Library Data in Wikimedia Projects: Case Study from the Czech Republic by Jansová, L., Maixnerová, L., & Š´tastná, P. (2024). "The paper outlines the collaboration between the National Library of the Czech Republic and Wikimedia since 2006, focusing on linking authority records with Wikipedia articles and training librarians and users. By 2023, the National Library provided most of its databases under a CC0 license, launched a "Wikimedians in Residence" program, and collaborated on projects involving linked data and using authority records in Wikidata. This partnership has enhanced their cooperation for mutual benefit, identifying key factors for their successful long-term collaboration."
How have you modelled my gender? Reconstructing the history of gender representation in Wikidata by Melis, B., Fioravanti, M., Paolini, C., & Metilli, D. (2024). "The paper traces the evolution of gender representation in Wikidata, showing how the community has moved from a binary interpretation of gender to a more inclusive model for trans and non-binary identities. The Wikidata Gender Diversity project (WiGeDi) timeline highlights the significant changes influenced by external historical events and the community's increased understanding of gender complexity."
Videos: Arabic Wikidata Days 2024 - Data Science Course - First Practical Session: Wikibase-CLI Tool (part 1, part 2) by Saeed Habishan. "The Wikibase-CLI enables command-based interaction with Wikidata using shell scripts and JavaScript. The tool runs on NodeJS and enables automatic reading and editing of Wikidata."
Tool of the week
WikiORA - is a tool designed for gene over-representation analysis. It integrates data from Wikidata, Wikipedia, Gene Ontology, and PanglaoDB to help researchers identify significantly enriched gene sets in their data.
New General datatypes property proposals to review:
About box (Screenshot of the About Box of the respective software (contains important information such as authors, license, version number and year(s) and is included in almost every software))
nonprofit tax status (country specific tax status of organisations like non-profits)
Newest WikiProjects: Uganda - aims to be a central hub for the curation of any and all items (biographical, cultural, geographical, organisational, etc...) relating to Uganda (Q1036)
WikiProject Highlights:
Narration/Folktales - creation of Items for motifs described in Thompson's motif index completed
Austria - concerns itself with improving data from nonprofit organizations in Austria
Showcase Lexemes: ਲੇਟਣ (L750580) - in Punjabi (pa) and "لیٹݨ" in Punjabi Shahmukhi (pnb) transliterate to "Leṭaṇ," which means "to lie down" or "to rest" in English.
Development
Most of the development team staff are still taking a break, so no development happened.
Here's your quick overview of what has been happening around Wikidata in the week leading up to 2025-01-06. Missed the previous one? See issue #660
Discussions
New request for comments: Constraints for Germanies - Following from a property discussion on P17 (German non-states), this RfC aims to find consensus on how to apply constraints that exclude items of historical periods in German history.
Please submit your proposals for the Data Reuse Days online event until January 12th. See current proposals on the talk page and here's some ideas to inspire you: presentations/demos of tools using Wikidata's data (10mins Lightning Talk presentations), discussions and presentations connecting Wikidata editors with reusers and/or explanations and demos on how to use a specific part of the technical infrastructure to reuse Wikidata's data (APIs, dumps, etc.).
Talk to the Search Platform / Query Service Team --January 8, 2025. The Search Platform Team holds monthly meetings to discuss anything related to Wikimedia search, Wikidata Query Service (WDQS), Wikimedia Commons Query Service (WCQS), etc.! Time: 16:00-17:00 UTC / 08:00 PDT / 11:00 EDT / 17:00 CET
The next Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 15th January 2025 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs: (fr) female authors with male pseudonyms, blog post by Le Deuxième Texte including SPARQL queries to find female authors with male pseudonyms.
Websites :Global Dementia and Risk Factors, website by 'Students at the Maastricht Science Programme', includes data visualizations of the prevalence and current treatments of dementia across the world. It utilises data extracted as SPARQL Endpoints from Wikidata.
Papers
Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema - This paper proposes an ontology-driven approach to KG construction using LLMs where competency questions guide ontology creation and relation extraction, leveraging Wikidata for semantic consistency. A scalable pipeline minimizes human effort while producing high-quality, interpretable KGs interoperable with Wikidata for knowledge base expansion. By Xiaohan Feng, Xixin Wu & Helen Meng (2024).
Knowledge Incorporated Image Question Answering Using Wikidata Repository - Proposes a Visual Question Answering (VQA) model that integrates external knowledge from Wikidata to address complex open-domain questions by combining image, question, and knowledge modalities. Evaluated on the VQAv2 dataset, the model outperforms prior state-of-the-art approaches, demonstrating improved reasoning and accuracy (Koshti et al., 2024).
Videos: (arabic) Part 6: SPARQL Demo Session: connecting external services - Sparql SERVICE clause gives access to additional data such as labels via wikibase:label, interaction with MediaWiki APIs using wikibase:mwapi, and integration of data from subgraphs (such as the main graph and the scholarly articles graph). Integration of data from external SPARQL endpoints such as DBpedia.
Tool of the week
Wikidata Entity Linker - is a Microsoft Edge browser extension that creates web links for matching inner HTML text based on a regex format of Q\d+ which is the format of a Wikidata Entity ID. (email)
Other Noteworthy Stuff
Vacancy: Research Software Engineer / Wikibase-Expert - The Technische Informationsbibliothek (TIB) located in Hannover has a research position open for someone interested in the deployment, administration and maintenance of open source knowledge management software such as Mediawiki, Wikibase and OpenRefine as part of the NFDI4Culture partnership within the OSL.
January 1, 2025, marked Public Domain Day, with hundreds of 1929 films entering the public domain. Sandra has shared helpful notes to assist in making these films discoverable via WikiFlix, by adding video files to Wikicommons and Wikidata. Join the effort!
New General datatypes property proposals to review:
About box (Screenshot of the About Box of the respective software (contains important information such as authors, license, version number and year(s) and is included in almost every software))
nonprofit tax status (country specific tax status of organisations like non-profits)