Here's your quick overview of what has been happening around Wikidata over the last week.
Discussions
Closed request for permissions/Bot: TiagoLubianaBot 2 (Task: The Open Targets Platform is an important biomedical resource that makes its data available under a CC0 license waiver. The request at hand is to extend the permissions of User:TiagoLubianaBot to import information from Open Targets, initially with a focus on physical interactions between proteins and drugs.)
Ongoing: Weekly Lexemes Challenge #107, Teeth (Challenge started on 2023-08-28 12:01:31)
Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour August 28, 2023. We will cover adding the articles from our bibliography of diverse LIS resources that we created in our last session to Wikidata! We will learn how to use Zotero, the reference management software, and its browser plugin to automatically extract metadata for articles and convert them to a Wikidata format that can be batch uploaded using QuickStatements. To get a sneak preview of what we will be doing, you can learn more about the Wikidata & Zotero link here: Wikidata:Zotero. This session will be recorded and the recording shared on the event page. Event page
Keynote about Wikidata for the Chinese Confernence on Knowledge Graphs and Semantic Computing 2023, by Denny Vrandečić - YouTube
How to fill an Infobox on Wikipedia using Wikidata - Wikipedia For Beginners - YouTube
The impact of Wikidata-powered inboxes on minority and low-resourced language Wikipedias in Africa (Wikimania 2023) - YouTube
How to create item in Wikidata about a village (in Malayalam) - YouTube
Add Wikidata link to an OpenStreetMap relation (in Guarani) - YouTube
WikiDBs: A Corpus Of Relational Databases From Wikidata - YouTube
Tool of the week
Wikitrivia is a web-based game that challenges your knowledge of historical events, people, and places. The game is a combination of Sudoku and Scrabble, and all the data used in the game is sourced from Wikidata and Wikipedia. The objective of the game is to place the cards on the timeline in the correct order.
Other Noteworthy Stuff
Biyanto Rebin is joining the Software Communication team (SCoT) at Wikimedia Deutschland for 2.5 months as an intern. Welcome!
simule (the element imitates or makes the value of the property appear real)
sexe ou genre du protagoniste (regarding the work in question, For human: male, female, non-binary, intersex, transgender female, transgender male, agender. For animal: male organism, female organism. Groups of same gender use subclass of (P279))
unofficial name (name which is widely or rarely used, but not official (for nicknames use P1449, for pseudonymes P742))
Here's your quick overview of what has been happening around Wikidata over the last week.
Discussions
Open request for adminship: FlyingAce (RfP scheduled to end after 8 September 2023 23:32 UTC)
New request for comments: Must 'Serious' WikiData sources be selective? (The disagreement regards the interpretation of WD:N rule number two, which states an item is acceptable if: "It refers to an instance of a clearly identifiable conceptual or material entity that can be described using serious and publicly available references.")
Job opening: Software Engineer (Wikidata) (m/f/d). For our Team Wikidata, based in Berlin, we are looking for a permanent Software Engineer (m/f/d), full-time or part-time (min. 32h/week), as soon as possible.
Wikidata's eleventh birthday is around the corner! One of our traditions is to prepare birthday presents to the community. Now is a great time to start preparing yours!
has heir or beneficiary (people or organizations that received, or will receive, (all or part of) the subject's property or title)
towing capacity (the maximum sustainable force with which this vehicle can pull or push another object. Alternatively for road- or track-vehicles, the maximum weight of an object on wheels that this vehicle can reliably and safely pull given usual slopes)
learning outcome (specific knowledge, skills, and abilities that students are expected to acquire as a result of participating in a particular education program)
tribe (recognised membership in a society, mainly denoted by shared cultural heritage; for ethnicity use P172)
Peppercat is a website listing government ministers, and other key political leaders, from all over the world, taking data from Wikidata. More information
state of transmission (state of transmission of a work (as a concept, not as a physical item; for that, use P5816))
describes actor of (predicate sense whose actor (an agent or a cause—basically the instigator of an action) is denoted by this sense)
describes undergoer of (predicate sense whose undergoer (a patient, a theme, or a recipient—basically a non-instigator of an action) is denoted by this sense)
Machine learning: We are migrating some tools that currently use ORES to the new Lift Wing (phab:T343731)
EntitySchemas: We are continuing the work around the new datatype to link to EntitySchemas in statements
Query Builder: We fixed some issues in the Query Builder language selector (phab:T344231)
We’re making the warning for anonymous editors more useful by letting them return to the page they came from after logging in (phab:T330550); we’re also working on showing those warnings in the first place in WikibaseLexeme (phab:T343979)
Wikibase REST API: We are working on the ability to remove a statement from a Property (phab:T342976) and get the labels of a Property (phab:T342977)
As part of the changes for the Better diff handling of paragraph splits wishlist proposal, the inline switch widget in diff pages is being rolled out this week to all wikis. The inline switch will allow viewers to toggle between a unified inline or two-column diff wikitext format.[16]
The Enterprise API is launching a new feature called "breaking news". Currently in BETA, this attempts to identify likely "newsworthy" topics as they are currently being written about in any Wikipedia. Your help is requested to improve the accuracy of its detection model, especially on smaller language editions, by recommending templates or identifiable editing patterns. See more information at the documentation page on MediaWiki or the FAQ on Meta.
Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour September 15, 2023. We will learn how to use Zotero, the reference management software, and its browser plugin to automatically extract metadata for articles and convert them to a Wikidata format that can be batch uploaded using QuickStatements.More information on the event page
chronological designation (as a typical instance, the stated scholarly journal year to which reference is made by: the reference source being cited to support the statement being made)
We are planning the migration of some of our existing components from the Wikit to the Codex design system in Query Builder, Mismatch Finder and the Special:NewLexeme page.
Warnings about not being logged-in will now have a returnto= parameter attached to their links, so that you can don’t use your flow from logging in (phab:T330550)
We fixed an issue with the LanguageSwitcher in Query Builder where it would open out of the viewport on some tablet screen widths (phab:T344231)
Wikibase REST API: We finished the work on making it possible to remove a statement from a Property (phab:T342976) and getting the labels of a Property (phab:T342977)
WikidataCon 2023, the conference dedicated to the Wikidata community, is taking place on October 28-29, online all around the world and onsite in Taipei. You can now register for the conference.
Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour September 29. This event is part of a series where you can gain hands-on experience with Wikidata by working on a diverse library and information science (LIS) dataset. In this fourth session, we'll introduce the Wikimedia PAWS environment for data gathering and processing in your Wikidata projects. We'll focus on web scraping for article data using Python and the Beautiful Soup package for parsing. You'll learn about data models for making your data accessible to both machines and humans. This session will be recorded and shared on the event page. Event page.
Ongoing: Weekly Lexemes Challenge #110, Ohm's law (Challenge started on 2023-09-25 12:01:32)
User:Luca.favorido/linkypop.js is a script that can be used to search an identifier on an external site. It provides a button to search for an identifier as soon as you type it in the property input field. For example, if you type “ORCID”, an icon with a lens will appear, and when you click it, a new tab will open with the ORCID site looking for the name of the researcher. You can then copy the URL and paste it to Wikidata.
Other Noteworthy Stuff
Want to play a game? Wikidata:Games has some Wikidata-related ones for you.
Citation.js Toolforge is a service to export citations from Wikidata items in various formats (BibTeX, RIS) and various citations formats (Vancouver, APA, etc)
type of musical notation (system of musical notation used on a given music source or composition)
chronological designation (stated scholarly journal year to which reference is made by: source cited in support of a particular statement may not be the same as the publication date or volume)
We worked on making it possible to upload mismatches on qualifiers (phab:T313467)
We are continuing to migrate tools from ORES to Lift Wing (phab:T343731)
We've added Wikifunktions as a new wiki for sitelinks (phab:T342857)
Wikibase REST API: We are working on making it possible to get labels, descriptions and aliases from a Property as well as modify the description of an Item
January 12 - 14, 2024, Berlin, Germany (and online) - CFP: Provenance Loves Wiki - A workshop on art history, art science and provenance research in Wikidata/Wikipedia & Wikibase
Ongoing
Weekly Lexemes Challenge #110, Ohm's law (Challenge started on 2023-09-25 12:01:32)
This Week
Next Linked Data for Libraries LD4 Wikidata Affinity Group call October 3, 2023: Lars Willighagen will discuss on citation.js.org, Wikidata, and plans for more linked data. Agenda
Wikidata for Better Health (in French) - Houcemeddine Turki, Faculty of Sciences SFAX < closing session of "Adapting Wikidata to support clinical practice using Data Science, Semantic Web and Machine Learning"
Swiss Archives is an interactive overview map of the Swiss archives present in Wikidata have corresponding Wikipedia articles and in which language (FR, DE, IT), allowing interested Wikimedians to know where they can contribute or expand. - Michael Gasser (X post)
WikiProject GLAM-BW - a project to connect major collections held by museums in Baden-Württemberg by uploading information on collectors, former collection locations, collecting histories, and objects
Development
Wikibase REST API:
We are finalizing the work on getting the labels, descriptions and aliases of a Property.
We are finishing work on modifying the descriptions of an Item.
You can now add sitelinks to Wikifunctions (phab:T342857)
We improved the “required” marker on Special:NewLexeme, hopefully making its meaning clearer (phab:T322683)
If you are not logged-in, you’ll also get the yellow warning when editing Lexeme’s Lemmas, Forms, and Senses (phab:T343979)
We added the tlh-latn and tlh-piqd codes for monolingual text, so that now you can add the titles to Shakepear’s works in the original Klingon (phab:T286239)
Mismatch Finder:
Failed uploads now no longer offer to download review results (phab:T335864)
We are working on the ability to report mismatches on qualifiers (phab:T313467)
The Mismatch Finder will show a clarifying message when Java Script is disabled (phab:T343344)
The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC, 18th October 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour October 13, 2023: Over the summer and into the fall the LD4 Wikidata Affinity Group will be offering a series of Wikidata Working Hours to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to. The fifth Wikidata Working Hour in the series will cover manually adding authors and publishers from our bibliography into Wikidata. This session will be recorded and the recording shared on the event page. Event page
Is AI lying to us? These researchers built an LLM lie detector of sorts to find out - Tiernan Ray, ZDNET ("Step one is to come up with a list of over 20,000 questions and known answers, drawn from sources such as Wikidata, for which the large language model, in this case, OpenAI's GPT-3.5, can be reliably expected to provide the correct answer.")
Data Scraping, Gathering & Annotation (in Ukrainian VO, English text) < lecture on data gathering, Wikidata, SPARQL and structured data from Ukrainian Catholic University
Wikidata: first pragmatic approach - Ismael Olea - OpenSouthCode 2023 < an introduction to Wikidata - the objective of this talk is for the public to leave amazed and addicted to Wikidata.
Tool of the week
https://aletheiafact.org <- is a new fact-checking website using Wikidata for people/concept identification. The website allows users to contribute to fact-checking claims made by public figures, such as politicians, celebrities, and influencers.
Wikibase REST API: We worked on the new routes for PATCHing Property and Item aliases as well as PUTing Property labels and descriptions (phab:T342982, phab:T337371, phab:T337371, phab:T348150)
We are continuing to work on fixing an issue with Lexeme pages missing styles and scripts (phab:T344362)
We’re adding some missing license notes to some javascript UI interfaces (phab:343998, phab:T343999)
Mismatch Finder: We are continuing the work on supporting mismatches for data that is stored in qualifiers (phab:T313467)
WMF GLAM report: Wikisource Loves Manuscripts, ICOM outreach, Flickr Foundation partnership, OpenRefine adoption, new sources in The Wikipedia Library, Image Description Month events, and the GLAM Wiki Conference
Next Linked Data for Libraries LD4 Wikidata Affinity Group call October 17, 2023: Our scheduled speaker had to unexpectedly cancel, so please join us to transfer group call notes from the Google Drive to the wiki project page (see instructions link in agenda document!) Agenda
The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC, 18th October 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Wikidata Day NYC '23 Join fellow Wikidata-enthusiasts on October 29th at the Butler Library, 114th Street West, NYC for a free Wikidata-celebration with a selection of engaging workshops and presentations to attend.
State of the Map EU 2023 has a session on OSM-Wikidata Map Framework on Sunday, 12th Nov., taking place in Antwerp, Belgium.
Wikidata Days: WM-Peru (in Spanish) attend a 2-day event, with sessions devoted to Wikidata, OpenRefine, QuickStatements and SPARQL.
Wikimedia Chile: Wikidata Training Course Staying in South America, Chilean and all Spanish-speakers can attend a 4-day Course and earn a participation Certificate. Online event - 17 to 20th October.
Press: Indexing news with AI via BroadCast Pro ME ("We are able to create a rich multilingual archive, leveraging translations based on the Wikidata knowledge graph.")
Drag'n'Drop Gadget <-- This handy gadget allows you to click and drag information from a Wikipedia article and add it to a Wikidata-item as a Statement. By Magnus Sälgö.
LIVE Wikidata editing #110 <-- Ainali and Abbe98 do some live editing on Wikidata (in English), and discuss the thought process of what we are doing and why we do it.
User:Nikki/ChecksumCheck.js <-- This script displays a symbol after external identifiers which contain checksums, to indicate whether the checksum in the identifier is correct. Currently only supporting a small number of Identifiers.
Community Configuration 2.0 is a feature that will enable Wikimedia communities to easily customize and configure features to meet their unique needs. This approach provides non-technical moderators with more independence and control over enabling/disabling and customizing features for their communities.
Initial designs are drafted for two different approaches (see images). We will soon demo interactive prototypes to interested admins, stewards, and experienced editors (T346109). Please let us know if you have feedback on the design approach, or want to participate in prototype testing.
IP Masking
The Growth team has been working on several updates to ensure Growth maintained features will be compatible with future IP Masking changes. This work has included code changes to: Recent Changes (T343322), Echo notifications (T333531), the Thanks extension (T345679) and Mentorship (T341390).
Before December, the Growth team will initiate community discussions with the goal of migrating communities from Flow to DiscussionTools. This move aims to minimize the necessity for additional engineering work to make Flow compatible with IP Masking. (T346108)
Mentorship
We assembled some resources for mentors at Mediawiki wiki. This resource page is translatable and will be linked from the mentor dashboard.
We are working to resolve a bug related to mentors properly returning after being marked as "Away". (T347024)
We continue the deployment of the structured task "add a link" to all Wikipedias. We plan to scale the task to all Wikipedias that have link suggestions available by the end of 2023.
We plan to scale the new Impact Module to all Wikipedias soon, but first we are investigating a bug with the job that refreshes the Impact Module data. (T344428)
At some wikis, newcomers have access to the "add an image" structured task. This task suggests images that may be relevant to add to unillustrated articles. Newcomers at these wikis can now add images to unillustrated articles sections. (T345940) The wikis that have this task are listed under "Images recommendations" at the Growth team deployment table.
Other news
We disabled the “add an image” task temporarily (T345188) because there was a failure in the image suggestions pipeline (T345141). This is now fixed.
After a 2.5 years-long collaboration with Bangala Wikipedia, we have decided to start a collaboration with another wiki. Swahili Wikipedia is now a pilot wiki for Growth experiments.
Edit-A-Thon for LGBTQ+ History Month < Binghamton University Library, New York, Oct. 25th. Help make visible the important contributions of queer figures.
Wikidata Birthday Edit-A-Thon (Albanian) < Albanian language Wikimedians celebrate Wikidata's Birthday with a 2-day event (Oct. 28 - 29, 2023) near the "Aleksandër Xhuvani" University, Elbasan.
Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour October 23, 2023: Over the summer and into the fall the LD4 Wikidata Affinity Group will be offering a series of Wikidata Working Hours to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to. The sixth Wikidata Working Hour in the series will cover batch creating items using OpenRefine.This session will be recorded and the recording shared on the Event page
4th Wikidata Workshop as part of the International Semantic Web Conference. 7th Nov., Athens, Greece.
Live Editing #110 < Develop your SPARQL query-building skills in this follow-along session.
Tutorial on Wikidata for or WikiConnect course (in Portuguese) <-- Explaining the Wiki Movimento Brasil's WikiConecta Wikidata course. The course entails Understanding what Wikidata is and its operating logic, Learning the basics of editing and recovering data, and Understanding why and how to use Wikidata with your students.
Birthday Presents from Data Engineering and Semantics Research Unit:
MedCYN as an intuitive web tool for Wikidata-based clinical decision support.
MeSH2Wikidata as an approach for validating and classifying biomedical relations in Wikidata based on MeSH Keywords of PubMed scholarly publications.
Tool of the week
User:Magnus Manske/annas archive.js is a userscript that automatically links to Anna's Archive from Wikidata items for books and research articles, for title, DOI, ISBN, etc. so that people can easily get access to them.
Other Noteworthy Stuff
There is a new update relative to the Wikidata Query Service scaling of the backend, that explains how the team will experiment with splitting the Wikidata Query Service graph and use federation for the queries that need access to all subgraphs.
Mismatch Finder tool improvements: In the next deployment scheduled for November 1, the tool will let you report mismatches on qualifiers in addition to the main part of a statement.
We are preparing for WikidataCon and the Data Modelling Days.
We are looking more into where Wikibase needs to be adapted to the upcoming IP Masking changes.
We added a notification about the license to all edits to labels, descriptions and aliases that was missing still (phab:T343998) The same for Lexemes is coming next (phab:T343999)
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to the 600th Weekly Summary!
Lydia initiated the weekly newsletter at the start of the Wikidata project, before it even went live, to keep the community in the loop about the developments, the new projects and tools. Léa carried on the newsletter in 2016 and then it was my turn in 2020. The newsletter has been going strong for eleven years, with its content powered by the community, and delivered every week without fail. Thank you to everyone who helped fill in the different sections of the Weekly Summary thus far ❤️ --Mohammed
Today it is time to celebrate Wikidata’s 11th birthday. Let’s take a look back at the past year and what’s coming.
There are now over 12.200 amazing people who are actively editing on Wikidata - 3000 of them even making more than 100 edits a month ♥️ Thank you! Without you Wikidata wouldn’t be what it is today. Thank you for helping give more people more access to more knowledge every single day. This year also marks the year we can welcome a new sister to the Wikimedia projects: Wikifunctions is live, letting us all geek out on functions in anticipation of Abstract Wikipedia. Another big milestone was Wikibase Cloud coming out of private beta. Now you can more easily run your own Wikibase and collect and maintain data that doesn’t fit into Wikidata. 2023 was also the year when the efforts of Wikidata editors were recognized across the Wikimedia movement with the awarding of the Wikimedian of the Year award to Taufik Rosman and the Wikimedia Laureate award to Siobhan Leachman. Over the coming year I want us to find ways how we can bring more people, who are already editing a bit here and there, closer to the community and help them find their place in our community. If you are one of them and have not found your place yet, check out a WikiProject related to your interests. Wikidata has something for everyone 😉
Wikidata now has over 106 Million Items and nearly 1.2 Million Lexemes. We are closing in on 2 Billion edits, making about 20 Million edits per month. All this content is used to create useful, quirky, educational or just fun applications that wouldn’t be possible without Wikidata and all the work you put into it. Check out Notable People for example. Over the coming year we want to make that even easier by building better APIs, lessening the strain on the Query Service, doing more outreach to developers as well as making our data more usable by ironing out ontology issues. In addition there will be increased focus on improving how the other Wikimedia projects integrate Wikidata. With the opening up of Wikibase Cloud we will hopefully also see many new Wikibases pop up that cover more specialized data or be used as playgrounds to prepare data for Wikidata. I am looking forward to a growing Wikibase Ecosystem and excited about more Linked Open Data becoming available to the world, with Wikidata being an entryway to it all.
And last but not least: if you want to learn a bit more about the history and backstory of Wikidata, then you might like Wikidata: The Making Of by Denny, Markus and me.
List of presents gathered by the community for Wikidata's eleventh birthday
Luthor is a multi-lingual tool for adding usage examples to lexemes on Wikidata, from sentences found on Wikisource in the same language. (by Asaf Bartov)
সংকলক একটি সরঞ্জাম যেটা দিয়ে উইকিসংকলনের লেখাগুলির উইকিউপাত্ত আইটেম অনুসারে সে লেখাগুলিকে উন্নত ভাবে অনুসন্ধান করা যায়। - Sangkalak is a tool with which Wikisource works can be searched more readily, using the Wikidata items for those works. (present from Mahir256) (by মাহির২৫৬-এর উপহার)
Creating new Lexemes? For a lot more languages you now no longer need to provide a spelling variant when creating a new Lexeme, making it even easier to contribute data about words in your language. (present from the development team)
The Mismatch Finder, the tool to help review mismatches between Wikidata and other data sources now also has support for mismatches on qualifiers. This allows it to be useful also for issues that are in the data in qualifiers. (present from the development team)
Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group call October 31, 2023: We will hear from Darnelle Melvin, Cory Lampert, and Andre Hulet, University of Nevada-Las Vegas Libraries, on WireframeVG: A Search and Discovery Application for Wikidata Projects. Agenda
How cultural institutions use Wikidata (How cultural heritage institutions sharing their collection data, including implementing Wikidata projects, batch uploading datasets to Wikidata, and how to share successes to a broader audience) - Jackie Rubashkin, Metadata Technician, Barack Obama Presidential Library; Michelle van Lanschot, Project Coordinator at Wikimedia-Netherlands; William Blueher, Associate Museum Librarian at the Metropolitan Museum of Art and Will Kent, Wiki Education
(in Chinese) Wikidata基礎編輯教學 (Wikidata basic editor) - Conference for Open Source Coders, Users & Promoters (COSCUP) 2023
Notable People is a map project by Topi Tjukanov that showing birthplaces of the most "notable people" around the world. It uses the combined data of Wikipedia and Wikidata from the paper's "a cross-verified database of notable people, 3500 BC-2018 AD" by Morgane Laouenan, Palaash Bhargava, Jean-Benoît Eyméoud, Olivier Gergaud, Guillaume Plique & Etienne Wasmer. The data shows only one person for each unique geographic location with the highest notability rank.
Wikimedia Research Fund Update update. "You can apply for research funds (USD 2K-50K) until December 15, 2023. While all research proposals related to Wikimedia projects are welcome, we particularly encourage research studies on medium to small size languages and communities, as well as in low resourced languages and projects."
The Wikidata For Wikimedia Projects team is investigating the different ways Wikidata is used in the sister projects. We would like to speak with you about your experiences integrating or connecting Wikidata, if you'd like to tell us, please sign up for an interview on our project page or on our Registration Form.
Wikidata:WikiProject Academic Publisher - a project to display open access shares (among others based on publishers) at the Austrian Datahub for Open Access Negotiations and Monitoring
Development
Wikibase REST API: We are continuing the work on making it possible to remove a description in a given language from an Item, modify the label of an Item and add aliases to an Item (phab:T342986, phab:T342980, phab:T335842)
Lexicographical data: We switched the Property that is used to pre-select the spelling variant on the Special:NewLexeme page from P218 (ISO 639-1 code) to P305 (IETF language tag) to make use of the latter’s larger coverage. (phab:T348923)
EntitySchemas: We are continuing to work on addressing the feedback from the testing of the new datatype in the test system.
Upcoming: Data Modelling Days, November 30-December 2. You can propose a session until November 19. If you are running another event or meetup and would like to connect it to the Data Modelling Days, you can add it to the satellite events section.
Knowledge Graphs and Semantic Web is a collection of preceedings from the 5th Iberoamerican Conference and 4th Indo-American Conference, KGSWC 2023, Nov. 13–15. By F. Ortiz-Rodriguez, B. Villazón-Terrazas, S. Tiwari & C. Bobed.
linguistic family of place name (Relates directly a placename to its original language family. It's not the language in which the toponym is written, but the language from which the word (place name) comes from.)
according to (to be used together with P248 if the statement is taken from an aggregator rather than directly from the source)
counterexample (qualifier for deprecated P279 statements; example instance or subclass of the item class for which a "subclass of" statement does not hold (alias: "item belonging to the subject class but not the value class"))
Newest WikiProjects: Source Reliability "is an effort to identify and aggregate online sources of assessments of the reliability and credibility of sources".
Development
Wikibase REST API:
We have implemented the endpoints for PATCH /entities/items/{item_id}/aliases (phab:T337371), PATCH /entities/properties/{property_id}/aliases (phab:T342982) and PATCH /entities/properties/{property_id}/labels (phab:T342980)
We started working on the new endpoints for POST /entities/items/{item_id}/aliases/{language_code} (phab:T335842) and DELETE /entities/items/{item_id}/descriptions/{lang_code} (phab:T342986)
Language codes:
We made Special:NewLexeme more likely to guess the spelling variant for you by changing the Property we use to get the language code. Now you will see the spelling variant input pop up less often. (phab:T349652)
We started work towards allowing many more languages by default for Wikibase Lexeme and monolingual text statements. This will remove the need for a lot of requests for new language codes to be added. (phab:T341409)
Query Service UI: We’ve fixed a small issue in the Query Service UI that was introduced when updating CSS variables. Content of selected cells in the query result should be readable again. (phab:T350153)
EntitySchemas: We are continuing the work on the new data type for linking to EntitySchemas, working on overcoming architectural issues.
South Africa report: Edit-a-thon for Librarians at the annual Library and Information Association of South Africa 2023 Conference
Sweden report: Wikipedia for all of Sweden; Museums and Wikidata – why and how?; Photo memories from Stockholm and Rome; Negotiating Knowledge on Wikipedia
Here's your quick overview of what has been happening around Wikidata over the last week.
Discussions
Open request for adminship: S8321414 (RfP scheduled to end after 13 November 2023 14:50 UTC)
New requests for permissions/Bot:
DiFoolBot 2 (Task: import VIAF ID based on Union List of Artist Names ID)
KizuleBot (Task: Adding sitelinks to Serbian Wikipedia for maintenance categories (see contributions of account) based on items for English Wikipedia's ones)
KormiSKbot (Task: Linking newly created pages on SKWiki to the appropriate Wikidata items.)
Linked Data for Libraries LD4 Wikidata Affinity Group call November 14, 2023: We will hear from Diego Saez-Trumper on Wikidata Revert Risk and Annotool. Agenda
Latin America in Wikidata Challenge (Spanish) < Nov. 14 - Dec. 14. Play the game and help highlight the region hosting the GLAM Wiki conference! (Prizes available).
Edit-a-thon "Women in sciences" < 17 Nov. 12:00 - 18 Nov. 03:00 (UTC). A live edit-a-thon to improve the quality and variety of articles of women who have won the prestigious L'Oréal-UNESCO National Award "For Women in Science". (Prizes available).
Language community meetings: A new initiative by WMF language team to organize quarterly gatherings to encourage collaboration among individuals and communities interested in language-related technical topics. First meeting: Friday, November 17, 2023, 16:00 to 17:00 (UTC)
Glam Wiki Program - 16 - 18 November, Montevideo, Uruguay. Want to attend in person? - Register here. Wikidata Related Sessions:
Knowledge Graphs - Foundations and Applications course by Prof. Dr. Harald Sack. October 11, 2023 - November 21, 2023. Enrol here if you have not done so yet.
GSOC ‘23: Automating Area Management in MusicBrainz <-- Prathamesh, in their Google Summer of Code project, developed a data pipeline to automate the synchronization of area metadata between MusicBrainz and Wikidata.
Papers
Wikidata for authority control: sharing museum knowledge with the world <-- The project “Usable Authorities for Data-driven Cultural Heritage Research” aims to link museum authority data to Wikidata, enhancing the visibility, accessibility, and relevance of information across different museum collections, and encouraging cultural heritage institutions in Sweden to contribute to and utilize the Wikimedia platforms.
The 4th Wikidata Workshop for the scientific Wikidata community happened on 07 November 2023. You can find a full list of presented papers here
Videos
Connecting Entomological Collectors ECN2023 <-- A presentation for the Entomological Collections Network 2023 conference by Siobhan Leachman, this presentation explains how Wikidata can be used to create an identifier for entomological collectors, empowering the collation & linking of biographical data as well as the ability to link to other databases & catalogs relating to those collectors.
Wikidata projects in Wikimedia Spain <-- part of the online sessions organized by Wikimedia España to celebrate the 10th anniversary of Wikidata. In this session, Ángel Obregón, a member of Wikimedia España, presents some of the projects driven by Wikidata by the members of WMES.
Notebooks:
Subclass of... <-- Wikidata's ontology is complex. This tool aims at finding if an item is a subclass of another one.
The Wednesday Index <-- A longitudinal analysis of gender diversity in Wikipedia articles.
Tool of the week
OpenFlights.org is now getting some of its airline data from Wikidata. It is a free open-source tool that allows you to log, map, calculate, and share your flights and trips.
WikiProject IDEA - This Wikiproject serves as a working group and process documentation archive for the International (Digital) Dura-Europos Archive (IDEA), a project generously funded by the National Endowment for the Humanities (NEH) and in development at Bard College and Yale University.
WikiProject Interwiki - The goal of this project is linking Wikidata items with wiki articles outside of the Wikimedia ecosystem. Niche wikis frequently offer more in-depth content compared to general online dictionary articles, although their quality can vary significantly.
WikiProject SrpKor - The main aim of this project is building Wikidata entities for the novels from SrpKor: Corpus of the contemporary Serbian language.
WikiProject Echinodermata - A repository for information pertaining to the NSF Grant Echinoderm Project.
Development
Wikibase REST API: We continued working on the new endpoints for POST /entities/items/{item_id}/aliases/{language_code} (phab:T335842) and DELETE /entities/items/{item_id}/descriptions/{lang_code} (phab:T342986)
EntitySchemas: We are continuing the work on the new data type for linking to EntitySchemas, working on overcoming architectural issues.
Language codes: We continued work towards allowing many more languages by default for Wikibase Lexeme and monolingual text statements. This will remove the need for a lot of requests for new language codes to be added. (phab:T341409)
Canary (also known as heartbeat) events will be produced into Wikimedia event streams from December 11. Streams users are advised to filter out these events, by discarding all events where meta.domain == "canary". Updates to Pywikibot or wikimedia-streams will discard these events by default.[45]
Data Modelling Days, from November 30th to December 2nd: 3 days of online events to address data modelling challenges, discuss how to improve the way we structure data together, and discover the point of view of external reusers. Feel free to have a look at the program (under construction) and to sign up as a participant.
Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour November 20th, 2023: Over the summer and into the fall the LD4 Wikidata Affinity Group will be offering a series of Wikidata Working Hours to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to. The seventh Wikidata Working Hour will cover the Author Disambiguator tool, which helps users assign authors to articles.During the session we will demonstrate how to use the tool on an author who was created during a previous working hour, and another who doesn't exist in Wikidata yet. After the demonstration, participants are encouraged to try the tool themselves during the rest of the working hour. This session will build on the work done in previous Working Hours by connecting authors to the articles they have written. This session will be recorded and the recording shared on the event page
ItWikiCon '23 (Italian) was hosted in Bari, Italy between the 17th - 19th November. Check the Programme for details on sessions and check for recordings or slidedecks of presentations.
GLAM Wiki 2023 took place in Montevideo, Uruguay. There were several Wikidata-related sessions some of which are linked in the Videos section.
Can you trust Wikidata? - is a paper exploring Wikidata's veracity and trustability for providing values to Knowledge Graphs. Written by V. Santos et al.
User-level gender statistics for Wikipedia - a tool that computes the number of articles created by gender has been repaired after some months of unavailability. It relies on xtools and P21 property.
Luthor - tool for finding usage examples from Wikisource and adding them to lexemes on Wikidata.
WikiProject Manuscripts - This WikiProject coordinates efforts on Wikidata to gather and curate structured data on manuscripts.
WikiProject Grove Hall Black Women Lead - aims to shed light on the lives and stories of Black women leaders who have shaped Boston’s history from the colonial era to the present day.
Newest database reports: User:Pasleim/projectmerge/enwiki-svwiki - 3875 merge candidates in English Wikipedia and Swedish Wikipedia based on same sitelink name.
Next Linked Data for Libraries LD4 Wikidata Affinity Group call November 28, 2023: As a satellite event for Data Modeling Days, we will facilitate community discussion around data modeling in Wikidata for library collections in a variety of formats, including people, books, serials, scholarly articles, rare materials, music, media, and realia. Agenda
Linked Open Data and Wikidata < Alan Ang (WMDE) talks about the importance of Linked Open Data and forming mutually beneficial partnerships between the Foundation and Institutions.
Notebooks
Explore new ways of visualising your data with a Circular Dendrogam, illustrated here with Association Football players broken down by Country and Team.
Tool of the week
Harvest Templates - is a tool that helps transfer data from Wikimedia projects to Wikidata.
User:MichaelSchoenitzer/Updown - is a userscript used for faster navigation. If there are a lot of values for one property it will add arrows that allow you to jump to the first/last value.
Other Noteworthy Stuff
WMDE is researching ways to improve the editing experience in different languages and would love to hear your feedback. We would like to talk to a few of you in online interviews to learn about your experiences, expectations, and concerns. Please let us know in this sign-up form if you are interested in taking part.
The Research team at WMF is running a second labeling campaign to evaluate the Revert Risk model for Wikidata. This is part of ongoing work on creating a new generation of Machine Learning models to support patrolling work on Wikimedia projects. Please help by going to this link, and labeling each revision in one of these three categories: Keep, Not Sure, Revert. Notice that "Not Sure" should be used in all cases where the Keep or Revert labels are not clear to you.
COR SEM (The Danish central word registry identifier)
UPOC ({{TranslateThis
| de = Ist eine eindeutige id zur Identifizierung von Organisationen/Sendungen/... zu einem Dienstleister.
<!-- | xx = Beschreibungen in anderen Sprachen -->
}})
WikiProject Heritage Collections - The aim of the present project is to create the world’s most comprehensive high quality database of archival fonds and heritage collections (including contemporary scientific collections or documentation holdings) and to ensure the interlinking of respective catalogues, finding aids, or collection databases with Wikidata.
WikiProject Events and Role Frames - The primary aims of WikiProject Events and Role Frames is to define a set of properties that consistently model event occurrences and their participants; to fill gaps in Wikidata regarding items for events and actions; and to encourage use of the proposed model and newly introduced items across Wikidata.
We continued to make a lot more language codes available (phab:341409)
EntitySchemas: We are experimenting with how to work around some technical blockers for the new datatype
Wikibase REST API: We've been working on the ability to remove an Item's label in a specific language and modify the descriptions on a Property (phab:T342981, phab:T342981)
Wikifunctions & Abstract Wikipedia Newsletter #134 is out: Welcome, Grace and Miguel! Appointing Functioneers now by community
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we present two new members of the team, we discuss the latest changes in software, and we announce that Functioneer right will be now assigned by the community.
Want to catch up with the previous updates? Check our archive!
This first meeting language will be English, but we plan to host conversations in other languages, and about other topics. Please visit the conversation page on-wiki for the details on how to join. You can also watch the page, or suggest ideas for upcoming conversations there.
Impact Module
At the beginning of November 2023, the Growth team deployed the New Impact Module to all Wikipedias. We recently released a follow up improvement to how edit data was displayed based on editor feedback. [55]
Developers can find some initial proof of concept code shared on gitlab.
Mentorship
When a mentor marked themselves as "Away", they were not getting their name assigned to new accounts when they returned. This has been fixed. [59]
We improved the message received by newcomers when their mentor quits, to reduce confusion. [60]
We worked on ensuring that all mentees are assigned to an active mentor. This required reassigning mentees with no mentors to a new mentor. We paused this as the clean-up script confused some editors. We will resume it when the identified blockers are resolved. [61]
It is now possible to create an Abuse Filter to prevent one user from signing up as a mentor. [62]
Around 250 war-threatened architectural monuments documented (German) - Wikidata, Wikibase and Commons are helping preserve and plan the restoration of culturally-significant Monuments damaged or destroyed by the Russian invasion of Ukraine.
ZotWb < export records in a Zotero group library to a custom Wikibase, prepare datasets to send to OpenRefine, feed OpenRefine reconciliaton results back to the Wikibase. Wikidata is envolved in the entity reconciliation. Here's a short explanation and demo video Tool is written and provided by David Lindermann with support from WMF Rapid Grant.
Montana Plant Life URL (URL for a plant family, genus, or species on the Montana Plant Life website)
event role (item that describes a role in an event class)
role in event (event class for which the item describes a role)
selectional preference ((to be used only with the subclasses of Q_event_role) an item that plays this role in an event instance should descend from this item via a combination of P31 and P279)
event arguments and types (item that plays a role in an event instance; used with a qualifier "argument type")
BnF archives and manuscripts ID (identifier for a manuscript in the archives and manuscripts catalogue of the Bibliothèque nationale de France (BnF). Do not include the initial "cc")
clerked for (this person has held a clerkship with the judge)
battery life (the length of time a device can continue to work before it needs its battery to be recharged)
Showcase Lexemes: läsa - 'read' about this Swedish word with many pronounciations and grammatical features.
Feel free to suggest next week's Showcase Item and Lexeme!
Development
Wikibase REST API:
We finished the endpoint for removing an Item's label in a specific language (phab:T335841) and the endpoint for modifying descriptions on a Property (phab:T342981)
We are working on the endpoint for adding aliases in a given language for a Property (phab:T343721) and removing a Property's label in a given language (phab:T342983)
Mismatch Finder: We are continuing the work on moving the tool over to the new design system Codex
We adjusted the styling for the values of monolingual text statements to make the language easier to distinguish from the value (phab:T280774)
mul language code: We made some final adjustments to get it ready for testing.
Lexemes: We are adding a license note for anon users when editing a Lexeme’s lemma, a Form or Sense (phab:T343999)
Wikifunctions & Abstract Wikipedia Newsletter #135 is out: Announcing Wikifunctions on the Wikimedia Foundation blog. Looking for feedback on the Function page proposal
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we present the latest blogpost about us on WMF blog, and we invite you to share your feedback at the Wikifunctions' Project chat on the new proposed designs for the Function page.
Want to catch up with the previous updates? Check our archive!
Portugal report: Catalan culture and showcasing Wikimedia on both side of the Atlantic
Serbia report: Wikipedians in Residence, GLAM Wiki Conference
Sweden report: National Historical Museums of Sweden contributions; Photo memories from all over the world engage the community; Museum of medieval photo safari
Here's your quick overview of what has been happening around Wikidata over the last week.
Discussions
New requests for permissions/Bot: LccnBot (Task: Adds P244 to bibliographic entities base on library authority records.)
New request for comments: Duplicate References Data Model and UI < During Data Modelling Days '23, 2 proposals emerged trying to answer the question of how to handle duplicate References on Wikidata Items.
Next Linked Data for Libraries LD4 Wikidata Affinity Group call December 12, 2023: Several members of the Chinese Culture and Heritage Wikidata group will provide an overview of the group's Wikidata projects as well as the challenges they have encountered. Agenda
Data-SHS Bordeaux Week: Processing and Analyzing Quantitative Data in Human and Social Sciences 2023. Dec. 11 - 15, Bordeaux, FR.
OpenRefine - a open source tool for working with data < This session explores the advantages of using OR to wrangle, clean, transform and standardise data for Wikidata. Presented by Jinoy Tom Jacob at the IndiaFOSS3.0 Conference.
QLever SPARQl Engine < If you attended Data Modeling Days '23, you may have seen an extraordinary Session given by Hannah Bast and Johannes Kalmbach showcasing the power and advantages of the QLever engine. QLever can handle queries that cause the WDQS to timeout or allowing Federated queries and Geospatial!
(QLEver has already featured in Tool of the Week but we wanted to showcase it again after experiencing it at DMD '23)
counterexample (qualifier for deprecated P279 statements; example instance or subclass of the item class for which a "subclass of" statement does not hold)
WikiProject Heritage Collections: database of archival fonds and heritage collections (including contemporary scientific collections or documentation holdings) and to ensure the interlinking of respective catalogues, finding aids, or collection databases with Wikidata.
WikiProject Source Reliability: is an effort to identify and aggregate online sources of assessments of the reliability and credibility of sources.
Wikibase REST API: We continued work on the routes for adding aliases in a given language for a Property (phab:T343721) and removing a Property's label in a given language (phab:T342983)
Monolingual text values can now use many more languages than before. We’re still working on doing the same for Lexemes. (phab:T341409)
Wikifunctions & Abstract Wikipedia Newsletter #136 is out: Looking back at 2023
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we look back at all the accomplishments that we achieved in 2023 and we thank you all for your help in achieving them. Also, we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!
This will be the last newsletter for 2023, so see you again in 2024!
Other discussions: How to handle concepts of trans people on Wikidata? Should {privacy at wikidata.org} be redirected to {privacy at wikimedia.org} or should it be monitored by Wikidata volunteers? Join the discussion!
Upcoming: Next Linked Data for Libraries LD4 Wikidata Affinity Group Working Hour December 18th, 2023: Over the summer and into the fall the LD4 Wikidata Affinity Group will be offering a series of Wikidata Working Hours to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to.The ninth and final Wikidata Working Hour in the series will be using SPARQL and Scholia to query and visualize the data we’ve added to Wikidata during our series. This session will be recorded and the recording shared on the event page
Blogs: #LD42023. Part I: The Future of Wikidata + Libraries (A Workshop) - This blog series explores how libraries engage with Wikidata and Linked Data in the face of AI challenges. Led by Silvia Gutiérrez and Giovanna Fontenelle from the Wikimedia Foundation, the series summarizes insights from a collaborative session at the 2023 LD4 Conference, using Design Thinking strategies to connect the Library-Wikidata community with WMF, focusing on Wikidata, Wikibase, and Structured Data on Commons (SDC) in libraries. By Silvia Gutiérrez & Giovanna Fontenelle
Papers
Wikipedia gender gap: a scoping review - This review analyzes Wikipedia's gender gap from 2007 to 2022, revealing a slight majority of female authors, addressing key themes, and exploring strategies to mitigate the gap, providing valuable insights into the research landscape in this domain. By Núria Ferran-Ferrer, Juan-José Boté-Vericad and Julia Minguillón.
Ten years of Wikidata: A bibliometric study - This research delves into scholarly publications about Wikidata from its inception in 2012 to late 2022, revealing 945 relevant papers, primarily from conferences. The analysis highlights a concentration of experts and contributors from the Global North, as well as governmental institutions as predominant funders. The study calls for enhanced networking and outreach to promote diversity and inclusion within the Wikidata research community. Emphasizing computer science perspectives, the research focuses on methods for developing and utilizing open knowledge graphs, notably Wikidata, with a narrower but significant interest in application-oriented studies in digital humanities, biology, and healthcare. (Turki, et al)
Videos
Duplicating Everywhere All at Once | Cebuano Wikipedia - Five years ago, Lsjbot's Wikipedia articles caused duplicate Wikidata items, notably impacting geographic places on Cebuano Wikipedia. This video by User:Canley at Wikimania 2023 delves into the history, visualizes the issue, and suggests cleanup strategies for Wikidata and Wikipedia, emphasizing Aotearoa New Zealand and parts of Australia, with implications for the global challenge of bot-created duplicates.
Useful Authorities for Data-Driven Collection Research with Alicia Fagerving - Alicia Fagerving, Wikimedia Sverige, introduces the project "Useful Authorities for Data-Driven Collection Research" and Wikidata. The project, spanning 2021-2023, links vocabularies from the databases of Nationalmuseum and Statens historiska museer to Wikidata, exploring it as a platform for semantic interoperability among cultural heritage institutions and providing tools and visualizations for similar projects.
2023: OSM-Wikidata Map Framework. Combining OpenStreetMap and Wikidata allows to leverage the strengths of the two projects to create richer maps. This talk explores how OSM-Wikidata Map Framework simplifies this process. By Daniele Santini
It's not bad! Measuring Gérard Depardieu's mark on French cinema (in French) - The analysis centers on Gérard Depardieu's impact on French cinema amid legal issues and sexual assault allegations. Despite difficulties in addressing these accusations, the author leverages Wikidata to measure Depardieu's influence by querying films from directors born after 1930 to assess his involvement.
How to Become a Billionaire: A Billionaire's Occupations Network Analysis - This network analysis investigates billionaires’ primary sources of income with a network graph—based on their occupations—connecting billionaires from all over the world and uncovering some of the biggest industries in the world.
Drama Corpora Project (DraCor) is a digital database of plays, primarily from Europe. It collects and organizes texts of plays in a way that allows researchers and others to extract and analyze information from those texts. This could include details about the characters, the dialogue, the stage directions, and more. The data is being pulled from Wikidata.
We finished adding the endpoints for adding aliases in a given language for a Property (phab:T343721) and removing a Property's label in a given language (phab:T342983)
We started working on the endpoint for removing a Property's description in a given language (phab:T342985)
We are fixing an issue with incorrect handling of lowercase statement IDs in edit requests (phab:T352644)
Special:PrefixIndex now shows label/lemma for Properties and Lexemes (phab:T343115)
Language codes: We changed where Wikidata is getting its languages from for Lexemes and Monolingual text statements and thereby resolved many tasks requesting another language being added to them (phab:T341409)
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here: What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship: EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot: Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming: Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers: Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest WikiProjects: WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers: Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos: WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs: PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks: Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project Software Collaboration for Wikidata prematurely. Read their joint statement here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (phab:T344041)
We are working on getting a sitelink for a given wiki (phab:T344039)
Wikifunctions & Abstract Wikipedia Newsletter #138 is out: The Joy of Collaboration; Introducing the Function of the Week: reverse string
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we present you an essay from Denny and we introduce a new section of the newsletter, dedicated to the "function of the week". Also, we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!
We also wanted to note that we are in the process of moving the Updates from Meta to Wikifunctions. We'll keep you updated about it.
Poland report: Intense end to a year of GLAM-Wiki activities in Poland
Sweden report: Photo memories project concludes; Sörmlands museum passes 1000 uploads to Wikimedia Commons; Wikimedian in Residence supports an upload of music content; Subject terms from Queerlit; Wikidata for authority control: 3 years of work
USA report: WikiConference North America 2023; TSU and USF; Philadelphia WikiSalon; Wikimedia DC Annual Membership Meeting; Wikipedia Editing 101 for All; NYC Hacking Night; Upstate NY workshop; Wikiquote She Said Project
Here's your quick overview of what has been happening around Wikidata over the last week.
Discussions
New request for comments: Community request for the development team to access inverse properties on client wikis. (Summary: We currently cannot access inverse property values on Wikipedia. This can be a data management issue on Wikipedia as we must always ask ourself if we must introduce an inverse property for cases where we need them. So I think it’s useful to gather the usecases community would want and draft a request for an API to the devteam to do that.)
Upcoming: The next Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Past
Provenance Loves Wiki (PLW24), Jan 12th - 14th, research and data on the origin of artworks and cultural heritage and how Wikibase and Wikidata can support this.
WikiLovesWomen #SheSaid campaign wrapped up the 2023 campaign by visiting Kinshasha and Kisangani, where local Wikimedians improved quotes from women on FR Wikipedia and Wikidata.
QLever: a new way to query OpenStreetMap --> Discussion of the new opportunities offered by QLever to query OpenStreetMap and to run federated queries with Wikidata
Wikidata for authority control: 3 years of work --> The three-year Wikidata for authority control project, a collaboration between Wikimedia Sverige and Swedish museums, concluded in December 2023. It equipped museum staff with tools and skills to integrate their authority databases with Wikidata, resulting in added identifiers, SPARQL query proficiency, and enhanced knowledge sharing within the GLAM sector.
Go-ahead for Wikidata Project of GLAM institutions from Baden-Württemberg --> The GLAM-BW project, under "GLAM goes OpenData," connects major collections in Baden-Württemberg, focusing on the württembergische Kunstkammer. With over 3,000 objects, the project integrates information on collectors, histories, and objects into a knowledge graph for semantic searches, contributing to the broader realm of linked open data, akin to Wikidata.
Swiss GLAM Programme --> Wikimedia CH imported the Museum of Natural History of Neuchâtel's urchin fossil casts to Wikimedia Commons, connecting structured data on Wikidata. The project involved data cleaning, adding missing elements, and file imports via OpenRefine, highlighting seamless integration between Wikidata and Commons.
Papers
Reflections on the PCC Wikidata Pilot at UCLA Library: --> Undertaking the PCC Learning Objectives. Discusses the 14-month Pilot programme for cooperative cataloguing of UCLA Library and Museum Collections. By E. Zhang, P. Biswas & I. Dagher.
SMWCon 2023: Semantics, Wikis, and AI --> Day 1, Keynote by Prof. Markus Krötzsch who explores origins and principles of semantic wikis and key challenges that lie ahead in managing knowledge.
Brian M Sperlongano released US boundary QA checker, a quality assurance tool for finding issues with boundary data in the United States by using Wikidata, OpenStreetMap, and US Census Bureau data.
The Surrounding Ocean (available at vrandezo.github.io/TheSurroundingOcean) - is a tool that allows you to browse lexicographical data. You can use the tool to explore words and their meanings, translations, and synonyms. The tool is currently under development, and the developer, Danny, would appreciate feedback to fix any issues with the tool. More info: Wikidata:The Surrounding Ocean.
WikiProject Highlights: Ontology Cleaning Task Force: A group of people have started a task force to discuss problems with the Wikidata ontology and how to clean them up. Anyone interested in participating is welcome. The task force maintains Wikidata:WikiProject Ontology/Cleaning Task Force as a record of its activities. You can add yourself to the participants list there and find out how to join group meetings or otherwise participate in the group. (Got something noteworthy happening in your WikiProject? Share it in the upcoming issue!)
IP masking: We are working on adjusting Wikibase to handle the upcoming introduction of IP masking, which will give editors who are not logged in a temporary account name instead of using their IP to attribute edits to (phab:T351968)
Lexicographical data: We are changing how empty Senses and Forms are represented in the dumps (phab:T305660)
mul language code: We are doing user testing for the current implementation to see if it is understandable for people.
Mismatch Finder: We are continuing the work on migrating it to the Codex design system.
REST API:
We improved the handling of lower-case statement IDs (phab:T354262)
We are working on getting a sitelink for a given wiki (phab:T344039)
Wikifunctions & Abstract Wikipedia Newsletter #139 is out: Refreshing the Function page; Function of the Week: ROT13
There is a new update for Abstract Wikipedia and Wikifunctions. Please, come and read it!
In this issue, we announce a redesign and rewrite of the Function page and we present our first "function of the week". Also, we take a look at the latest software developments.
Want to catch up with the previous updates? Check our archive!
Here's your quick overview of what has been happening around Wikidata over the last week. Translations are available.
Discussions
New request for comments: Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next: Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation, register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm. Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (phab:T353807, phab:T352006) and creating temporary accounts when editing (phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (phab:T352644)
mul language code: We did user testing to find any remaining issue before release