Wikipedia sentiment analysis

Databases: Wikipedia with DBpedia
Papers: DOAJ Google Scholar PubMed
Ontologies: MeSH NeuroLex Wikidata Wikipedia
Wikipedia is a Web-based encyclopedia edited as an open wiki.


[edit] Researchers

  1. Aaron Halfaker
  2. Andrea Forte
  3. Andrew G. West
  4. Andrew Lih
  5. B. Thomas Adler
  6. Bill Wedemeyer
  7. Chitu Okoli
  8. Daniel Kinzler
  9. Dario Taraborelli
  10. David Milne
  11. Felipe Ortega
  12. Finn Årup Nielsen
  13. Guillaume Paumier
  14. John Riedl
  15. Joseph Michael Reagle Jr.
  16. Luca de Alfaro
  17. Maria Grineva
  18. Ofer Arazy
  19. Paolo Massa
  20. Piotr Konieczny
  21. Reid Priedhorsky
  22. Rut Jesus
  23. Sook Lim
  24. Torsten Zesch

See a further list on the Wikimedia meta-wiki.[1]

[edit] Data

[edit] Original 
API (for English Wikipedia). Raw wiki text, categorizations, lists, ratings, ... 
The official Wikipedia download sites with, e.g., compressed XML database dumps, see also Amazon Web Services [2]. 
Official page view counts begun by Domas Mituzas. Also available at 
Editor survey 
Picture of the year. 
Fundraising statistics

See also and

[edit] Derived data

DTU Informatics Cite journal miner (Article x journal)-matrix for scientific citations.
Freebase Wikipedia Extraction (WEX) 
Freebase extracted data from the English Wikipedia [3]
Various dataset. Taxobox - Wikipedia Infoboxes with Taxonomic information on Animal Species [4]. Autocompletion [5]
Statistics on outgoing links from Wikipedia. Statistics to GLAMs are as of 2012 collected.
Many Eyes 
[6] [7]
Social Networks from Wikipedia user talk pages [8][2] and activity patterns on Wikipedia after traumatic events [9]
Stanford large network dataset collection 
Complete Wikipedia edit history (up to January 2008) [10] by Gueorgi Kossinets and Jure Leskovec
Dataset created in April 2009 from the social bookmarking site Delicious and Wikipedia. 20,764 unique URLs are available. (Available for research).[3]
Wikipedia Page Traffic Statistics 
Data compiled by Peter N. Skomoroch, Data Wrangling, LLC and used in
English Wikipedia in RDF from categories and internal links.
Wikipedia 5000
Wiki page with the 5000 most visited pages on the English Wikipedia for a 10 day period. It is collected by Andrew G. West.
"Light dumps" for a a controversy measure.[4][5]

See also Datamob [11] and meta-wiki [12].

[edit] Tools

Toolserver application by Daniel Kinzler: . Display contributors to a specified article in various formats, across language version and, e.g., grouped by author. Example: [13]
Visualization of edit history [14]
Java Wikipedia Library, a free Java-based application programming interface to Wikipedia. [15].[6]
LA2 Extractor 
Lars Aronsson's Wikipedia template extractor. [16]
" is a quick little tool for extracting sets of pages from a MediaWiki dump file." [17]
Python library interface to Wikipedia
Readability of Wikipedia
[18] Web service that computed the readability of English Wikipedia articles.[7]
Software to analysis a wiki, e.g., with respect to content evolution, activity, users, pages and categories. [19]
Trending Topics 
Web site with view of trends in Wikipedia based on views [20]
Toolserver application by "MZMcBride". Counts the number of editors who is "watch"ing a specified page. Example query: [21]
Python scripts to extract (social) networks from MediaWiki dumps [22]
Example query:
Wikipedia article traffic statistics 
Page view statistics : Statistics in tables and graphs across languages and months based on raw data from
Wikipedia Miner 
Is a software and a Web service by David Milne and Ian H. Witten. The Web service has thesaurus like search, Wikipedia article comparison and a wikify method. [24] [25][8]
[26]. See also [27]
Javascript for data from DBpedia
[28] Display anonymous edits on a Google map.
Toolserver application by "MZMcBride". Finds the intersection of pages that two or more specified users having edited the same page.
Tool for archiving wikis [29]
Java API for reading Wikipedia XML dumps,
[30]. Project developed by José Felipe Ortega
Wiwordik is a Java-based visual interface to English and Russian Wiktionary databases.[9]

[edit] Defunct

  1. (as of 2010-07-13)

See a further list on Wikimedia meta-wiki[10] as well as the Wikipedia pages on statistics.[11]

[edit] Books

  1. Joseph Michael Reagle Jr. (2010). Good faith collaboration: the culture of Wikipedia. The MIT Press.
  2. Phoebe Ayers, Charles Matthews, Ben Yates (2008). How Wikipedia works: and how you can be a part of it. No Starch Press.
  3. Jeffrey T. Orloff (2010). MediaWiki 1.1: beginner's guide. Packt Publishing.
  4. Daniel J. Barrett (2008). Mediawiki (Wikipedia and Beyond). O'Reilly Media.
  5. Yochai Benkler (2006). The Wealth of Networks. Yale University Press.
  6. Andrew Lih (2009). The Wikipedia revolution: how a bunch of nobodies created the world's greatesst encyclopedia. Hyperion.
  7. Dan Woods (2007). Wikis for dummies. For Dummies.
  8. John Broughton (2008). Wikipedia: the missing manual. O'Reilly Media.

[edit] Theses

  1. Colaboracao Em ... (Portugues thesis)
  2. Dentistry in Wikipedia - a quantitative and qualitative analysis (Beurteilung der Qualität zahnmedizinischer Einträge in Wikipedia - ein Vergleich mit zahnmedizinischer Fachliteratur), Annette Lorenz
  3. Learning in public: information literacy and participatory media, Andreas Forte
  4. The writing process in online mass collaboration (Johannes Daxenberger, 2016)
  5. Wikipedia: a quantitative analysis, Felipe Ortega

[edit] Papers

See also emijrp's List of Wikipedia articles from Google Scholar

[edit] Reviews

See also WikiPapers List of survey papers

  1. A brief review of studies of Wikipedia in peer-reviewed journals
  2. Chinese-language literature about Wikipedia: a meta-analysis of academic search engine result pages
  3. Mining meaning from Wikipedia
  4. Research on Wikipedia vandalism: a brief literature review (2016)
  5. The people's encyclopedia under the gaze of the sages: a systematic review of scholarly research on Wikipedia
  6. "The sum of all human knowledge": a systematic review of scholarly research on the content of Wikipedia
  7. Twelve years of Wikipedia research
  8. What we know about Wikipedia. A review of the literature analyzing the project(s)
  9. Wikipedia in the eyes of its beholders: a systematic review of scholarly research on Wikipedia readers and readership
  10. Wikipedia research and tools: review and comments

[edit] Original papers

  1. A comparison of privacy issues in collaborative workspaces and social networks
  2. A content-driven reputation system for the Wikipedia
  3. A knowledge-based search engine powered by Wikipedia
  4. A multimethod study of information quality in wiki collaboration
  5. A quantitative approach to the use of the Wikipedia
  6. A quantitative comparison on online encyclopedias-a case study of Wikipedia and Knol
  7. A quantitative study for online encyclopedias: comparison of Wikipedia and Knol
  1. A systemic and cognitive view on collaborative knowledge building with wikis
  2. Piotr Konieczny (2010). "Adhocratic governance in the Internet age: a case of Wikipedia". Journal of Information Technology & Politics 7(4): missing pages. doi: 10.1080/19331681.2010.489408.
  3. Accuracy of cancer information on the internet: a comparison of a wiki with a professionally maintained database
  4. Accuracy and completeness of drug information in Wikipedia: a comparison with standard textbooks of pharmacology
  5. Aggregated trustworthiness: redefining online credibility through social validation
  6. Altmetrics in the wild: Using social media to explore scholarly impact
  7. An analysis of topical coverage of Wikipedia
  8. An effective, low-cost measure of semantic relatedness obtained from Wikipedia links
  9. An evaluation of Wikipedia as a resource for patient dducation in nephrology
  10. An examination of health, medical and nutritional information on the Internet: a comparative study of Wikipedia, WebMD and the Mayo Clinic websites
  11. Analysis of community structure in Wikipedia
  12. Analysis of the Wikipedia category graph for NLP applications
  13. Analyzing the creative editing behavior of Wikipedia editors: through dynamic social network analysis
  14. Applying the logic of multiple-valued argumentation to social web: SNS and Wikipedia
  15. Article recommendation based on topic model for Wikipedia Selection for Schools
  16. Articulations of WikiWork: uncovering valued work in Wikipedia through barnstars
  17. Assessing the value of cooperation in Wikipedia
  18. Assessment of online patient materials for breast reconstruction
  19. Assigning trust to Wikipedia content
  20. Automated query learning with Wikipedia and genetic programming
  21. Automated text summarization base on lexicales chain and graph using of WordNet and Wikipedia knowledge base (spelling errors in the title?)
  22. Automated creation of Wikipedia articles
  23. Automatic assignment of Wikipedia encyclopedic entries to WordNet synsets
  24. Automatic vandalism detection in Wikipedia
  25. Automatically generating Wikipedia articles: a structure-aware approach
  26. Avoiding tragedy in the wiki-commons
  27. Awarding the self in Wikipedia: identity work and the disclosure of knowledge
  28. Becoming Wikipedian: transformation of participation in a collaborative online encyclopedia
  29. Behavioral aspects in the interaction between Wikipedia and its users
  30. Beyond notability. collective deliberation on content inclusion in Wikipedia
  31. Beyond vandalism: Wikipedia trolls
  32. Beyond Wikipedia: coordination and conflict in online production groups
  33. Biographical social networks on Wikipedia - a cross-cultural study of links that made history
  34. Bieber no more: first story detection using Twitter and Wikipedia
  35. Blogs, Wikipedia, Second Life, and beyond: from production to produsage
  36. Building semantic kernels for text classification using Wikipedia
  37. Can history be Open Source? Wikipedia and the future of the past
  38. Can you ever trust a wiki?: impacting perceived trustworthiness in Wikipedia
  39. Characterizing and modeling the dynamics of online popularity
  40. Circadian patterns of Wikipedia editorial activity
  41. Citations to Wikipedia in chemistry journals: a preliminary study
  42. Classification of recommender expertise in the Wikipedia recommender system
  43. Clustering documents using a Wikipedia-based concept representation
  44. Collaboration in context: Comparing article evolution among subject disciplines in Wikipedia
  45. Collaborative knowledge management: evaluation of automated link discovery in the Wikipedia
  46. Collective intelligence and neutral point of view: the case of Wikipedia
  47. Collective memory building in Wikipedia: the case of North African uprisings
  48. Giacomo Poderi (2009). "Comparing featured article groups and revision patterns correlation in Wikipedia". First Monday 14(5): missing pages. [31].
  49. Computational methods for historical research on Wikipedia's archives
  50. Computing semantic relatedness using wikipedia-based explicit semantic analysis
  51. Computing trust from revision history
  52. Conception d'un outil d'aide à l'indexation de ressources pédagogiques - extraction automatique des thématiques et des mots-clefs de documents UNIT
  53. Conflict and consensus in the Chinese version of Wikipedia
  54. Consistency without concurrency control in large, dynamic systems
  55. Content disputes in Wikipedia reflect geopolitical instability
  56. Content hole search in community-type content
  57. Context and keyword extraction in plain text using a graph representation
  58. Copernicus - 3D Wikipedia
  59. Coordination in collective intelligence: the role of team structure and task interdependence
  60. Creating critical mass in collaboration systems: insights from Wikipedia
  61. Creating, destroying, and restoring value in Wikipedia
  62. Cultural bias in Wikipedia content on famous persons
  63. Credibility judgment and verification behavior of college students concerning Wikipedia
  64. Crossing textual and visual content in different application scenarios
  65. Cultural differences in collaborative authoring of Wikipedia
  66. Decentering design: Wikipedia and indigenous knowledge
  67. Detecting Wikipedia vandalism with active learning and statistical language models
  68. Deriving a large scale taxonomy from Wikipedia
  69. Discovering missing links in Wikipedia
  70. Does it matter who contributes: a study on featured articles in the German Wikipedia
  71. Does Wikipedia information help Netflix predictions?
  72. Does Wikipediaprovide evidence-based health care information? A content analysis
  73. Don't look now, but we've created a bureaucracy: the nature and roles of policies and rules in Wikipedia
  74. Drug safety in the digital age
  75. Early response to false claims in Wikipedia
  76. Edit wars in Wikipedia
  77. Encyclopedic knowledge patterns from Wikipedia links
  78. Emotions and dialogue in a peer-production community: the case of Wikipedia
  79. Employing Wikipedia's natural intelligence for cross language information retrieval
  80. Evaluating quality control of Wikipedia's feature articles
  81. Evaluating WikiTrust: a trust support tool for Wikipedia
  82. Evaluation of automatic linking strategies for Wikipedia pages
  83. Experimental study of informal rewards in peer production
  84. Explaining quality in Internet collective goods: Zealots and good Samaritans in the case of Wikipedia
  85. Exploiting Wikipedia as external knowledge for named entity recognition
  86. Exploring linguistic points of view of Wikipedia
  87. Exploring the problems with Wikipedia's editing rule for public relations
  88. Extracting directed information flow networks: an application to genetics and semantics
  89. Extracting lexical semantic knowledge from Wikipedia and Wiktionary
  90. Extracting semantic relationships between wikipedia categories
  91. Fast and faster: a comparison of two streamed matrix decomposition algorithms
  92. Finding social roles in Wikipedia
  93. Forced transparency: corporate image on Wikipedia and what it means for public relations
  94. Free and open-source software is not an emerging property but rather the result of studied design
  95. From Encyclopædia Britannica to Wikipedia
  96. Gender bias in Wikipedia and Britannica
  97. Gender differences in information behavior concerning Wikipedia, an unorthodox information source?
  98. Governance in social media: a case study of the Wikipedia promotion process
  99. Governance of massive multiauthor collaboration - Linux, Wikipedia, and other networks: governed by bilateral contracts, partnerships, or something in between?
  100. Group minds and the case of Wikipedia
  101. Handling flammable materials: Wikipedia biographies of living persons as contentious objects
  102. He says, she says: conflict and coordination in Wikipedia
  103. Hot off the wiki: dynamics, practices, and structures in Wikipedia's coverage of the Tōhoku catastrophes
  104. How today's college students use Wikipedia for course-related research
  105. Identifying and understanding the problems of Wikipedia's peer governance: the case of inclusionists versus deletionists
  106. Improving Wikipedia: educational opportunity and professional responsibility
  107. Individual focus and knowledge contribution
  108. Information quality discussions in Wikipedia
  109. Information quality work organization in Wikipedia
  110. Information uniqueness in Wikipedia articles
  111. Informed investors and the Internet
  112. Intelligence in Wikipedia
  113. Interlinking journal and wiki publications through joint citation: Working examples from ZooKeys and Plazi on Species-ID
  114. Internet encyclopaedias go head to head
  115. Investigations into trust for collaborative information repositories: a Wikipedia case study
  116. Is editing more rewarding than discussion?
  117. Is Wikipedia a relevant model for e-learning?
  118. Is Wikipedia biased?
  119. Is Wikipedia growing a longer tail?
  120. Is Wikipedia unsuitable as a clinical information resource for medical students?
  121. It's a network, not an encyclopedia: a social network perspective on Wikipedia collaboration
  122. Junior physician's use of Web 2.0 for information seeking and medical education: a qualitative study
  123. Large-scale named entity disambiguation based on Wikipedia data
  124. Large-scale taxonomy mapping for restructuring and integrating Wikipedia
  125. Learning to link with Wikipedia
  126. Learning to predict the quality of contributions to Wikipedia
  127. Legitimizing Wikipedia: how U.S. national newspapers frame and use the online encyclopedia in their converage
  128. Let's leave the bias to the mainstream media: a Wikipedia community fighting for information neutrality
  129. Lifting the veil: improving accountability and social transparency in Wikipedia with WikiDashboard
  130. Linking Wikipedia to the Web
  131. Managing information quality in virtual communities of practice
  132. Manypedia: comparing language points of view of Wikipedia communities
  133. Mapping bilateral information interests using the activity of Wikipedia editors
  134. Membership turnover and collaboration success in online communities: explaining rises and falls from grace in Wikipedia
  135. Measuring adjective spaces
  136. Measuring article quality in Wikipedia: models and evaluation
  137. Measuring public relations Wikipedia engagement: how bright is the rule?
  138. Measuring Wikipedia
  139. Mining meaning from Wikipedia
  140. Modeling and predicting page-view dynamics on Wikipedia
  141. Modeling events with cascades of poisson processes
  142. Modeling the structure and evolution of discussion cascades
  143. Modeling user reputation in wikis
  144. Modeling Wikipedia promotion decisions
  145. Mopping up: modeling Wikipedia promotion decisions
  146. Motivations of contributors to Wikipedia
  147. Motivations of Wikipedia content contributors
  148. Network analysis for Wikipedia
  149. Network analysis of collaboration structure in Wikipedia
  150. On the credibility of Wikipedia: an accessibility perspective
  151. On the evolution of Wikipedia
  152. Open, but how much? Growth, conflict and institutional evolution in Wikipedia and Debian
  153. Open source content contributors' response to free-riding: the effect of personality and context
  154. Patient-oriented cancer information on the Internet: a comparison of Wikipedia and a professionally maintained database
  155. Patterns of revision in online writing: a study of Wikipedia's featured articles
  156. Personality characteristics of Wikipedia members
  157. Philosophy democratized? A comparison between Wikipedia and two other Web-based philosophy resources
  158. Power of the few vs. wisdom of the crowd: Wikipedia and the rise of the bourgeoisie
  159. Preferential attachment in the growth of social networks: the internet encyclopedia Wikipedia
  160. Putting Wikipedia to the test: a case study
  161. Quality assessment of Wikipedia external links
  162. Quality of information sources about mental disorders: a comparison of Wikipedia with centrally controlled web and printed sources
  163. Quantitative analysis of the Wikipedia community of users
  164. Queering Wikipedia
  165. Raising the stakes: writing about witchcraft on Wikipedia
  166. Readability of Wikipedia
  167. Reengineering the Wikipedia for reputation
  168. Representation of medical informatics in the Wikipedia and its perspectives
  169. Reputation and reliability in collective goods: the case of the online encyclopedia Wikipedia
  170. (Re)triggering backlash: responses to news of Wikipedia's gender gap
  171. RESLVE: leveraging user interest to improve entity disambiguation on short text
  172. Revision and co-revision in Wikipedia: detecting clusters of interest
  173. RUOK? blogging communication technologies during crises
  174. rv you're dumb: identifying discarded work in wiki article history
  175. Scaling consensus: increasing decentralization in Wikipedia governance
  176. Schema evolution in Wikipedia: toward a web information system benchmark
  177. Scientometrics 2.0: new metrics of scholarly impact on the social Web
  178. Scope, completeness, and accuracy of drug information in Wikipedia
  179. Seeking health information online: does Wikipedia matter?
  180. Semantic content filtering with Wikipedia and ontologies
  181. Semantic convergence of Wikipedia articles
  182. Semantic relatedness metric for Wikipedia concepts based on link analysis and its application to word sense disambiguation
  183. Semantic Wikipedia
  184. Signed networks in social media
  185. Size matters: word count as a measure of quality on Wikipedia
  186. So you know you're getting the best possible information: a tool that increases Wikipedia credibility
  187. Social capital increases efficiency of collaboration among Wikipedia editors
  188. Social networks of Wikipedia
  189. Social rewarding in wiki systems - motivating the community
  190. Space and time as a primary classification criterion for information retrieval in distributed social networking
  191. Spatio-temporal analysis of Wikipedia metadata and the STiki anti-vandalism tool
  192. Studying cooperation and conflict between authors with history flow visualizations
  193. SuggestBot: using intelligent task routing to help people find work in Wikipedia
  194. Summaries of Wikipedia deletion discussions
  195. Studying cooperation and conflict between authors with history flow visualisations
  196. Taking up the mop: identifying future Wikipedia administrators
  197. Talk before you type: coordination in Wikipedia
  198. Teaching Wikipedia as a mirrored technology
  199. Temporal analysis of text data using latent variable models
  200. Temporal analysis of the Wikigraph
  201. Ten simple rules for editing Wikipedia
  202. The climate change controversy through 15 articles of Portuguese Wikipedia
  203. The correlation between Wikipedia and knowledge sharing on job performance
  204. The democratization of information? Wikipedia as a reference resource
  205. The dynamic nature of conflict in Wikipedia
  206. The effects of group composition on decision quality in a social production community
  207. The evolution of genre in Wikipedia
  208. The hidden order of Wikipedia
  209. The implications of Wikipedia for contemporary science education: using social network analysis techniques for automatic organisation of knowledge
  210. The influence of diversity and experience on the effects of crowd size
  211. The past, present, and future of Wikipedia
  212. The pathology informatics curriculum wiki : harnessing the power of user-generated content
  213. The quality of open access and open source internet material in gastroenterology: is Wikipedia appropriate for knowledge transfer to patients?
  214. The rise and decline of an open collaboration system: how Wikipedia's reaction to popularity is causing its decline
  215. The RNA WikiProject: community annotation of RNA families
  216. The role of conflict in determining consensus on quality in Wikipedia articles
  217. The social roles of bots and assisted editing programs
  218. The visibility of Wikipedia in scholarly publications
  219. The vocal minority: local self-representation and co-editing on Wikipedia in the Middle East and North Africa
  220. The Wikipedia XML corpus
  221. The Wikipediazation of the American Judiciary
  222. The WikiPhil Portal: visualizing meaningful philosophical connections
  223. The work of sustaining order in Wikipedia: the banning of a vandal
  224. There exist correlations between editing behaviors and hyperlinks structure in Wikipedia
  225. Time evolution of Wikipedia network ranking
  226. Topic indexing with Wikipedia
  227. Transformation of Wiktionary entry structure into tables and relations in a relational database schema
  228. Transparency and social responsibility issues for Wikipedia
  229. Understanding collaboration in Wikipedia
  230. User-generated content on the Internet: an examination of gratifications, civic engagement and psychological empowerment
  231. Using encyclopedic knowledge for named entity disambiguation
  232. Using links to classify Wikipedia pages
  233. Using Wikipedia at the TREC QA track
  234. Using Wikipedia categories for ad hoc search
  235. Using Wikipedia for automatic word sense disambiguation
  236. Using Wikipedia links to construct word segmentation corpora
  237. Using Wikipedia to extend digital collections
  238. Value production in a collaborative environment
  239. Vandals, administrators, and sockpuppets, oh my! An ethnographic study of Wikipedia's handling of problem behavior
  240. Visitors and contributors in Wikipedia
  241. Web 2.0 according to Wikipedia: capturing an organizing vision
  242. Web science 2.0: identifying trends through semantic social network analysis
  243. What is popular on Wikipedia and why?
  244. What is the quality of surgery-related information on the Internet? Lessons learned from a standardized evaluation of 10 common operations
  245. What it means to ban Wikipedia: an exploration of the pedagogical principles at stake
  246. What motivates Wikipedians?
  247. What's in Wikipedia? mapping topics and conflict using socially annotated category structure
  248. What's on Wikipedia, and what's not...? assessing completeness of information
  249. When the Wikipedians talk: network and tree structure of Wikipedia discussion pages
  250. Isto Huvila (2010). "Where does the information come from? information source use patterns in Wikipedia". Information Research missing volume: 433.
  251. Why is Wikipedia not more widely accepted in Korea and China? factors affecting knowledge-sharing adoption
  252. Why ornithologists should embrace and contribute to Wikipedia
  253. Why Wikipedia: self-efficacy and self-esteem in a knowledge-political battle for an egalitarian epistemology
  254. Why you can't cite Wikipedia in my class
  255. Wiki trust metrics based on phrasal analysis
  256. Wiki-surgery? Internal validity of Wikipedia as a medical and surgical reference
  257. Wikidentities: Young people collaborating on virtual identities in social network sites
  258. Wikipedia - a key tool for global public health promotion
  259. Wikipedia and institutional repositories: an academic symbiosis?
  260. Wikipedia and osteosarcoma: a trustworthy patients' information?
  261. Wikipedia and the semantic web--the missing links
  262. Wikipedia and the two-faced professoriate
  263. Wikipedia as a data source for political scientists: accuracy and completeness of coverage
  264. Wikipedia as a tool for forestry outreach
  265. Wikipedia as participatory journalism: reliable sources? metrics for evaluating collaborative media as a news resource
  266. Wikipedia - free and reliable?: aspects of a collaboratively shaped encyclopaedia
  267. Wikipedia leeches? The promotion of traffic through a collaborative Web format
  268. Wikipedia, past and present
  269. Wikipedia popularity from a citation analysis point of view
  270. Wikipedia Revision Toolkit: efficiently accessing Wikipedia's edit history
  271. Wikipedia risks
  272. Wikipedia: the educator’s friend (!)
  273. 'Wikipedia, the free encyclopedia' as a role model? Lessons for open innovation from an exploratory examination of the supposedly democratic-anarchic nature of Wikipedia
  274. Wikipedia: the free encyclopedia
  275. Wikipedia usage estimates prevalence of influenza-like illness in the United States in near real-time
  276. Wikipedia vandalism detection: combining natural language, metadata, and reputation features
  277. Wikipedian self-governance in action: motivating the policy lens
  278. Wikipedia edit number prediction based on temporal dynamics only
  279. Wikipedia's labor squeeze and its consequences
  280. WikiRelate! Computing semantic relatedness using Wikipedia
  281. Wikis and Wikipedia as a teaching tool
  282. Wikis and Wikipedia as a teaching tool: five years later
  283. Wikiversity; or education meets the free culture movement: An ethnographic investigation
  284. WP:Clubhouse? an exploration of Wikipedia's gender imbalance
  285. Writing, citing, and participatory media: wikis as learning environments in the high school classroom
  286. WRS: the Wikipedia Recommender System
  287. YAWN: a semantically annotated Wikipedia XML corpus
  288. YAGO: a core of semantic knowledge unifying WordNet and Wikipedia
  289. You are where you edit: locating Wikipedia contributors through edit histories
  290. "You just type in what you are looking for": undergraduates' use of library resources vs. Wikipedia

For publications related to Semantic Web and Wikipedia see also the bibliography at

[edit] Python

[edit] Count words in Wikipedia page

>>> import collections, mwparserfromhell, nltk, requests
>>> url = ''
>>> response = requests.get(url)
>>> text = mwparserfromhell.parse(response.content).strip_code()
>>> collections.Counter(nltk.word_tokenize(text)).most_common(5)
[(u'the', 38), (u'of', 18), (u'.', 15), (u'motor', 13), (u'to', 9)]

[edit] Count "cognitive" words in Wikipedia page

>>> from import CognitiveWords
>>> cogwords = CognitiveWords()
>>> collections.Counter(cogwords.find_all(text)).most_common(5)
[(u'motor', 17), (u'category', 3), (u'motor control', 2), (u'motor skill', 1), (u'movement', 1)]

Note here that 'Category' is wrong coming from the Catorization tag. This tag can be erased:

>>> import re
>>> collections.Counter(cogwords.find_all(re.sub('^Category:', '', text, flags=re.MULTILINE))).most_common(5)
[(u'motor', 17), (u'motor control', 2), (u'motor skill', 1), (u'movement', 1)]

[edit] External links

  1. — list of academic studies.

[edit] Blog posts

  2. Beyond reliability: An ethnographic study of Wikipedia sources, Heather Ford.

[edit] References

  2. Paolo Massa(2011). "Social networks of Wikipedia". ACM Hypertext 2011: 22nd ACM Conference on Hypertext and Hypermedia.
  3. Arkaitz Zubiaga(2009). "Enhancing navigation on Wikipedia with social tags".
  4. Dynamics of conflicts in Wikipedia
  5. Edit wars in Wikipedia
  6. Torsten Zesch, Christof Mueller, Iryna Gurevych(2008). "Extracting lexical semantic knowledge from Wikipedia and Wiktionary". Proceedings of the Conference on Language Resources and Evaluation (LREC).
  7. Readability of Wikipedia
  8. David Milne. "An open-source toolkit for mining Wikipedia". [1]
  9. A. A. Krizhanovsky, F. Lin "Related terms search based on WordNet / Wiktionary and its application in Ontology Matching"
