Wikipedia talk:WikiProject Fact and Reference Check/Archive 1
There were many ideas in the Wikipedia channel about this topic, I saveed the log and will extract the basis of the ideas here later (and will attribute them as well). --ShaunMacPherson 14:01, 1 Sep 2004 (UTC)
Implications for WikipediaThere have been a few comments that people are worried that this project may interrupt how Wikipedia currently operates. I don't think that fact checking need necessarily interrupt Wikipedia, what it will seem to do is is open up a new avenue of opportunity to people who are not verbal or literary, but have excellent research and fact finding skills. Those that just want to write can do so, and those who want to verify that the autoreferencing code is working properly, and that new facts are numbered and checked have their part to play as well. (Nothing precludes people from doing both too :) ). --ShaunMacPherson 17:03, 1 Sep 2004 (UTC) ExamplesI'll be working on forming a more concrete example by actually formatting a current Wikipedia article, and making its reference page. Of course I'll have to do it manually since the autoreferencing code has not yet been developed, but hopefully it will be a better preview then the Jack example, on the main page, as to how such fact and referencing might be implemented. --ShaunMacPherson 17:03, 1 Sep 2004 (UTC) ToolsSome ideas:
-Sewing - talk 18:26, 1 Sep 2004 (UTC) I like the idea, but since people are constantly adding to articles, we would have to tag checked articles to show when they were last checked. Danny 18:29, 1 Sep 2004 (UTC)
Here's the feature request for footnotes on MediaZilla: http://bugzilla.wikipedia.org/show_bug.cgi?id=192 - There's also some discussion on m:Footnotes. --Kurt Jansson 22:12, 1 Sep 2004 (UTC)
More tools.. --Quinobi 07:04, 15 Oct 2004 (UTC) Example?Instead of the trivial hypothetical "blue pants" example, could someone take on a real, non-trivial article as an example of how this might work in practice? I suggest starting with something that is reasonably stable, not terribly controversial, and not previously chock full of references. Maybe mayonnaise? -- Jmabel 19:36, Sep 1, 2004 (UTC)
Potential problems and complexitiesI can see a few possible difficulties with fact-checking in this fashion. Many of the problems mentioned on m:Footnotes apply. Assuming every fact is to be referenced in some way, I think it will be important to have the capability to tag a large block of text with a single reference; for instance, the very short stub (found by random-page browsing) Stubbekøbing contains this text:
Even this short stub contains many facts:
Referencing each fact separately, especially for long articles, would be a monumental task. Delimiting which fact(s) are being referenced seems like the only reliable way to do it. However, whenever a new fact is inserted inside a delimited chunk, new delimiters must be specified and/or a new reference added (increasing the complexity of our already complex wikicode markup). This leads into another problem - tracking changes in the article, and ensuring that references are not wrongly associated with unrelated facts (or vice versa). Aside from re-checking articles after each addition or alteration, I can't think of an obvious way to prevent this from causing problems. One possible way to eliminate the need for most references is to use references only for facts that may be unintuitive, not widely known, under dispute, or otherwise contentious. Of course, this opens a whole new can of POV worms: how to decide when a fact is incontrovertible enough to not require a reference. Any facts with a URL as a reference may quickly become outdated, especially if the reference is a news article that may not be archived for more than a few weeks. Obviously, additional information such as author, publication media and date would be strongly encouraged. I like the concepts discussed on m:Footnotes, and think they would go a long way to solving the lingering question of accountability, aside from the uglification of wikicode caused by them. A simple list of references alone would give us more credibility in many cases. I can think of no publication that references every significant fact, so we would likely be one of the first to attempt it. Some brainstorms:
Overall this seems like a very worthy endeavour. I think it could work if we give it enough thought. -- Wapcaplet 01:13, 2 Sep 2004 (UTC)
I am uneasy about the practicability of this idea. To work we would need to define the form the references are to be given, and ensure everyone sticks to them, eg Joe Bloggs, Clog Dancing for Dummies, Complexity Press, (2004), or Bloggs, Joe ---- . There is no consistency about how references are already written. If references are footnoted, means must be made to automatically update the numbering if new edits and references are made. In the New Oxford Dictionary of National Biography, to which I am a contributor, footnotes are not given, but a block of references is given at the end, including printed sources, portraits, sound recordings, motion pictures etc, and I suggest we adopt the same principle. To fact check and reference every statement in a paper is a big addition to the work of editors, and could well impact on the length of articles submitted due to the amount of extra work needed to do it comprehensively. Apwoolrich 06:31, 2 Sep 2004 (UTC) Some of the facts to be referenced in the spider article are actually matters of definition. Are we going to give citations for statements like, "Humans have two legs"? Sometimes we will need citations for shady areas, e.g., the idea of "races" of various non-human organisms in biology was subjected to a long debate and part of the reason for that debate was that not all participants realized that different specialties in biology habitually use the term in different ways, avoid its use, etc. The places where references are likely to be most useful are matters of objective fact that are not well known, e.g., that spider "silk" is a protein structure. People who find that matter interesting would benefit from references that give more information than would be appropriate in a general article on spiders. But all of the information that any well informed student of the subject will agree with, the number of legs, the lack of wings, etc., etc. is found in compact form in standard reference books such as B. J. Caston's How to Know the Spiders. P0M 20:26, 21 Sep 2004 (UTC) A proposed strategyVery little effective fact checking will be achieved on our massive body of information merely by arbitrarily starting somewhere, breaking things down to atomic facts, and checking them. As in software quality assurance, we need to spend our resources in such a manner as to get the most "bang for the buck". I would suggest that once we've decided what we will consider as acceptable verification (or falsification) of a factual statement (and that will not be a simple matter: sources can contradict each other), there are several reasonable strategies to choose appropriate facts to check. The key principles here are (1) nothing should be immune from fact-checking but (2) we should try to direct a disproportionately large amount of resources to areas we think are most likely to contain falsehoods. Here's what I'd propose:
That leaves 30% of resources to be allocated more arbitrarily, e.g. by people just picking something they are interested in checking and going for it. Obviously none of this is set in stone, and the percentages might change with time or other categories might be added: e.g. if we are trying to turn part of Wikipedia into a published book or CD-ROM, that would merit getting priority. -- Jmabel 06:39, Sep 2, 2004 (UTC)
As I point out above, there are too many separate factors here for a guideline as simple as 80-20. -- Jmabel 21:43, Sep 13, 2004 (UTC)
Related discussionThere's a related discussion going on here. You can see the entire thread by clicking on the subject line.--Eloquence* I noticed that many people are getting very interested, and eager, to make Wikipedia a more authoritative source of information. I also have been on some of the mailing lists and there has been discussion about refereeing articles like the old Nupedia. If you see any discussion or comments about referencing, fact checking, foot notes, please leave a brief comment about this project. Hopefully it will be a nexus for developers and wikipedians to discuss how to make wikipedia a better and more authorative source of information. --ShaunMacPherson 13:57, 21 Sep 2004 (UTC) Generated Facts?What about for fact that are not in the format needed. For example, say you have the date of birth of Napolean, and the year that he was exiled, and from those you get his age. Could the fact checking mechanism explain the math used to get the new fact? Jrincayc 01:32, 22 Sep 2004 (UTC)
How about this as a way forward?I've been thinking about this problem with my back brain, and I've come up with this suggestion using existing features of MediaWiki.
This namespace could be used for all sorts of primary material, e.g books, webpages, articles, magazines. I would imagine there would be different templates for each type of primary material. I think it quite likely that academics would be very interested in such a primary source namespace. Adding a Reference or Source namespace to Wikipedia would only require requesting it from the developers. Possessing a seperate namespace for articles and referencing of primary source material has a number of advantages. It is far more sustainable, because the references will properly checked as a whole against the primary material. And providing the primary material is rigourously checked, it would be fairly trivial to check a wikipedia article for correct referencing to the articles in the Reference namespace. Clearly such a namespace with appropriate referencing would strengthen the academic credentials of Wikipedia. Obviously the linking mechanism is a bit ugly and obtrusive still; but would be an easy change to the code. It would also be very useful if the wiki-ref links allowed internal Wikimedia links so that What links here would tell us which articles are referencing the facts, which would obviously be vital to check and update references. In the future there would be a number of ways developers could create a list of footnotes within a Wikipedia article. The data being organised into sections could be manipulated in a number of ways or even moved to a proper database. This would mean that people could work on this project without fear of wasting their time. :ChrisG 21:28, 4 Oct 2004 (UTC)
I can see the advantage of having multiple references for one fact collected together. But I think that it is working back to front. Trying to keep multiple references to a single fact correct would be a nightmare to do as it would be dissassociated form the primary sources. A fact relates to a specific primary source. It makes more sense to collect all the facts relating to a primary source attached to the article about the primary source, because it would combine the process of reviewing a primary source and noting down useful facts in one process. I'm sure somewhere on Wikipedia some other people were interested in starting a WikiReview project and so they would be interested. It would also mean that if a person reviewed a primary source properly, it would be suitable for use in multiple article, maximising the value of doing it. People looking for facts in articles would also have a seperate namespace they could search to look for suitable facts. Obviously for controversial articles you would see people jumping between both namespaces, creating a fact checking primary articles and then adding references on the wikipedia article. But that is to the benefit of everyone; because the arguments about the facts can take place in the reference namespace, not the wikipedia article talk page. I think it would remove a lot of controversy. Clearly there is a lot of junk using this method; but the important thing is it works now, so you don't have to wait for developers to add features. In addition it would be quite easy to ask developers to clean the process up, by amending the [reference] syntax so we could use it also to refer to internal wikipedia links and automatically making them superscript. Developers like easy changes. Allowing referral to internal Wikipedia links would mean the 'What links here' would identify where a primary source was used to support an article allowing for easy fact checking. I can understand your desire for a bracketing syntax and in the best of all worlds it would be the ideal solution. But it is a huge change to make in the code, and so you would need a committed to developer to work on it; and it would be a nightmare to do. It looks very simple when you bracket a specific phrase as a reference; but once people start editing the document you have a nightmare:
It would be hard to find a developer interested in solving that, because it is so difficult. Whatever solution they come up would almost certainly not be liked by everyone. Its a complex requirement and the best way to build towards it is incrementally. It is far simpler to add a reference after the fact you want to reference. People understand this from books; and many people will be careful with the references and indeed check them as they go. Fact checking proper should take place in the reference namespace. From my point of view this project as its first priority needs its own namespace in order to start fact checking and reviewing of primary sources. Once that is set up, people writing articles will soon start referencing the reference namespace and creating articles about suitable primary sources. The method of linking to these references is secondary and a technological problem. :ChrisG 14:40, 6 Oct 2004 (UTC) Submit a proposalI would really like to get started on this project, but I can't tell if there's a consensus for how to do so. Could everyone with an opinion summarize it in 50 words or less, or just add your name after someone else's summary if you agree with it totally? I'm probably going to do something with the music of the United States series using a user subpage to list cite facts. I'll start with music of the United States before 1900 with a reference page at the imaginatively titled User:TUF-KAT/music of the United States before 1900, if someone wants to see it. Tuf-Kat 02:41, Oct 13, 2004 (UTC)
I've already explained my proposed approach above (#A proposed strategy). It is based on the principles of quality assurance. It looks like others here have a very different idea in mind. I think it's epistemologically flawed and I've said as much. -- Jmabel | Talk 07:00, Oct 15, 2004 (UTC)
I agree with this completely. I guess there would be people who format the articles with footnotes, people who check facts, and people who respond directly to 'hot' topics with a lot of conflict, usually the same topics that contain a lot of falsehoods. I'd guess most of the time would be usefully spent in this 3rd catagory, on directing energy at areas that contain falsehoods, like you say. None of this can be effectively done though until a useful way to do footnotes are incoroprated into mediawiki. I suggest a <<bracket approach>>, but I'd take anything :). --ShaunMacPherson 07:53, 15 Oct 2004 (UTC) But, also, merely finding an independent citation does not "prove" the truth of a statement. It's one thing to get articles well-referenced (and I'm all for that, and I probably do about as much in that direction as anyone on the English-language Wikipedia), but it's another to actually be able to say they are anything more than referenced: that requires expertise on the subject matter. See, for an interesting example of how tricky this can be, the notes in Armenian (people) about the references for the population statistics. -- Jmabel | Talk 18:44, Oct 15, 2004 (UTC)
As I see it the reference namespace would be for academic studies, books and any other primary material. Of course, it wouldn't have the source material, it would be articles set up in a structure format about the source material, e.g:
I would imagine different formats for different types of primary material. The talk page could be used to discuss any aspect of the article; but I would presume it would be most useful to discuss the subtleties of referencing and fact checking. A very small percentage of this primary material would also be suitable encylopediac material, which would not be a repetition of the reference namespace, because it would be presented in a different less formal way. I can see a crossover here with various other project ideas, Wikiversity, WikiResearch, WikiReview etc. So perhaps it wouldn't be a reference namespace; but instead its own Wiki with its rules; and probably higher academic standards. I would imagine such a Wiki could well be more appealing to academics than Wikipedia itself. This disadvantage of a seperate Wiki would be that 'What Links Here' would not tell you who was referencing the source material. :ChrisG 23:54, 17 Oct 2004 (UTC) readhow do you get a free acount |