[PDF] A Topic-Aligned Multilingual Corpus of Wikipedia Articles for PDF 2020.lrec-1.289.pdf

coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian

For example, DBpedia only extracts data from non-English articles that have an interlanguage link3, to an English article 3 http://en wikipedia org/wiki/Help:

[PDF] Korean Linked Data on the Web: Text to RDF - CORE

1 Semantic Web Research Centre, KAIST, Daejeon, South Korea {mrezk,yoon ing Korean entities with Korean Wikipedia and preliminary results evaluating

[PDF] An Approach for Supplementing the Korean Wikipedia - SWRC

An Approach for Supplementing the Korean Wikipedia based on DBpedia Eun- kyung Kim, DongHyun Choi, Jihye Lee, JinHyun Ahn, and Key-Sun Choi

[PDF] Korean NLP2RDF Resources - Association for Computational

The Sejong corpus and its POS tagset (Korean Language Institue, 2012) are how to convert the NLP output to RDF and how to link entities with Wikipedia

[PDF] A Topic-Aligned Multilingual Corpus of Wikipedia Articles for

coverage in English Wikipedia (most exhaustive) and Wikipedias in eight other widely spoken languages (Arabic, German, Hindi, Korean, Portuguese, Russian

Chinese and Korean Cross-Lingual Issue News Detection based on

We propose translation knowledge method for Wikipedia concepts as well as the Chinese and Korean cross-lingual inter-Wikipedia link relations The relevance

Towards Bengali DBpedia - ScienceDirectcom

these are non-Latin character based, like Greek, Korean, Bengali etc Extracting structured data from these versions of Wikipedia is always challenging Though

[PDF] korean writing system

[PDF] korn ferry statistics

[PDF] kosovo patent country code

[PDF] kotlin language javatpoint

[PDF] kpi for employee performance

[PDF] kpi policy and procedure

[PDF] kpi report example

[PDF] kpi template

[PDF] kpis for business

[PDF] kpmg corporate tax rates

[PDF] kpmg pdf 2019

[PDF] kpmg report on digital marketing

[PDF] kpmg report pdf

[PDF] kpop business model

[PDF] kuala lumpur to bangalore malaysia airlines

2379dition to comparing the content of two articles in terms of

phrase overlap, it would be interesting to quantify the dif- ferences in terms offactsthat are present in different lan- guage version of an article. Such a structured metric can help in recommending additions/modification to the editors of the articles in different languages.

6. Bibliographical References

Adafre, S. F. and de Rijke, M. (2006). Finding similar sen- tences across multiple languages in wikipedia. InPro- ceedings of the Workshop on NEW TEXT Wikis and blogs and other dynamic text sources. Adar, E., Skinner, M., and Weld, D. S. (2009). Information arbitrage across multi-lingual wikipedia. InProceedings of the Second ACM International Conference on Web

Search and Data Mining, pages 94-103. ACM.

Balaraman, V., Razniewski, S., and Nutt, W. (2018). Re- coin: Relative completeness in wikidata. InCompanion of the The Web Conference 2018 on The Web Conference

2018, pages 1787-1792. International World Wide Web

Conferences Steering Committee.

Banerjee, S. and Mitra, P. (2015). Wikikreator: Improv- ing wikipedia stubs automatically. InProceedings of the

53rd Annual Meeting of the Association for Computa-

tional Linguistics and the 7th International Joint Confer- ence on Natural Language Processing (Volume 1: Long

Papers), volume 1, pages 867-877.

Bao, P., Hecht, B., Carton, S., Quaderi, M., Horn, M., and Gergle, D. (2012). Omnipedia: bridging the wikipedia language gap. InProceedings of the SIGCHI Conference on Human Factors in Computing Systems, pages 1075-

1084. ACM.

Barr ´on-Cedeno, A., Paramita, M.L., Clough, P., andRosso, P. (2014). A comparison of approaches for measuring cross-lingual similarity of wikipedia articles. InEuro- pean Conference on Information Retrieval, pages 424-

429. Springer.

Callahan, E. S. and Herring, S. C. (2011). Cultural bias in wikipedia content on famous persons.Journal of the American society for information science and technol- ogy, 62(10):1899-1915. Carmel, D., Roitman, H., and Zwerdling, N. (2009). En- hancing cluster labeling using wikipedia. InProceed- ings of the 32Nd International ACM SIGIR Conference on Research and Development in Information Retrieval,

SIGIR "09, pages 139-146, New York, NY, USA. ACM.

Cosley, D., Frankowski, D., Terveen, L., and Riedl, J. (2007). Suggestbot: using intelligent task routing to help people find work in wikipedia. InProceedings of the

12th international conference on Intelligent user inter-

faces, pages 32-41. ACM. Ferr ´andez, S., Toral, A., Ferr´andez, O., Ferr´andez, A., and Munoz, R. (2007). Applying wikipedia"s multilingual knowledge to cross-lingual question answering. InIn- ternational Conference on Application of Natural Lan- guage to Information Systems, pages 352-363. Springer. Filatova, E. (2009). Directions for exploiting asymmetries in multilingual wikipedia. InProceedings of the Third

International Workshop on Cross Lingual InformationAccess: Addressing the Information Need of Multilin-

gual Societies, pages 30-37. Association for Computa- tional Linguistics. Hale, S. A. (2014). Multilinguals and wikipedia editing. InProceedings of the 2014 ACM conference on Web sci- ence, pages 99-108. ACM. Hecht, B. and Gergle, D. (2009). Measuring self-focus bias in community-maintained knowledge repositories. InProceedings of the fourth international conference on

Communities and technologies, pages 11-20. ACM.

Hecht, B. and Gergle, D. (2010). The tower of babel meets web 2.0: user-generated content and its applications in a multilingual context. InProceedings of the SIGCHI con- ference on human factors in computing systems, pages

291-300. ACM.

Hieber, F. and Riezler, S. (2015). Bag-of-words forced de- coding for cross-lingual information retrieval. InPro- ceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguis- tics: Human Language Technologies, pages 1172-1182. Inoue, G., Habash, N., Matsumoto, Y., and Aoyama, H. (2018). A parallel corpus of Arabic-Japanese news arti- cles. InProceedings of the Eleventh International Con- ference on Language Resources and Evaluation (LREC

2018), Miyazaki, Japan, May. European Language Re-

sources Association (ELRA). Katz, G., Shtock, A., Kurland, O., Shapira, B., and Rokach, L. (2014). Wikipedia-based query performance predic- tion. InProceedings of the 37th International ACM SI- GIR Conference on Research & Development in In- formation Retrieval, SIGIR "14, pages 1235-1238, New

York, NY, USA. ACM.

Kunchukuttan, A., Mehta, P., andBhattacharyya, P. (2018).

The IIT Bombay English-Hindi parallel corpus. In

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018),

Miyazaki, Japan, May. European Language Resources

Association (ELRA).

Lin, C.-Y. (2004). Rouge: A package for automatic eval- uation of summaries. InText Summarization Branches

Out: Proceedings of the ACL-04 Workshop.

Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. (2002). Bleu: A method for automatic evaluation of machine translation. InProceedings of the 40th Annual Meeting on Association for Computational Linguistics, ACL "02, pages 311-318, Stroudsburg, PA, USA. Association for

Computational Linguistics.

Paramita, M. L., Clough, P. D., and Gaizauskas, R. J. (2017). Using section headings to compute cross-lingual similarity of wikipedia articles. InAdvances in Informa- tion Retrieval - 39th European Conference on IR Re- search, ECIR 2017, Aberdeen, UK, April 8-13, 2017,

Proceedings, pages 633-639.

Park, S., Kim, S., Hale, S. A., Kim, S., Byun, J., and Oh, A. (2015). Multilingual wikipedia: Editors of primary language contribute to more complex articles. InNinth International AAAI Conference on Web and Social Me- dia. Pfeil, U., Zaphiris, P., and Ang, C. S. (2006). Cultural dif- of Computer-Mediated Communication, 12(1):88-113. Potthast, M., Stein, B., and Anderka, M. (2008). A wikipedia-based multilingual retrieval model. InAd- vances in Information Retrieval , 30th European Confer- ence on IR Research, ECIR 2008, Glasgow, UK, March

30-April 3, 2008. Proceedings, pages 522-530.

Schwenk, H., Chaudhary, V., Sun, S., Gong, H., and Guzm ´an, F. (2019). Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia.CoRR, abs/1907.05791. Shirakawa, M., Nakayama, K., Hara, T., and Nishio, S. (2013). Probabilistic semantic similarity measurements for noisy short texts using wikipedia entities. InPro- ceedings of the 22nd ACM International Conference on Information & Knowledge Management, CIKM "13, pages 903-908, New York, NY, USA. ACM. Stvilia, B., Twidale, M. B., Smith, L. C., and Gasser, L. (2008). Information quality work organization in wikipedia.Journal of the American Society for Informa- tion Science and Technology, 59(6):983-1001. Wang, P. and Li, X. (2019). Assessing the quality of infor- mation on wikipedia: A deep-learning approach.Jour- nal of the Association for Information Science and Tech- nology, 0(0). Wolk, K. and Marasek, K. (2015). Building subject- aligned comparable corpora and mining it for truly par- allel sentence pairs.CoRR, abs/1509.08881. Wulczyn, E., West, R., Zia, L., and Leskovec, J. (2016). Growing wikipedia across languages via recommenda- tion. InProceedings of the 25th International Confer- ence on World Wide Web, pages 975-985. International

World Wide Web Conferences Steering Committee.

Zhang, B., Lin, Y., Pan, X., Lu, D., May, J., Knight, K., and Ji, H. (2018). Elisa-edl: A cross-lingual entity extrac- tion, linking and localization system. InProceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demon- strations, pages 41-45.quotesdbs_dbs21.pdfusesText_27

[PDF] [PDF] A Topic-Aligned Multilingual Corpus of Wikipedia Articles for

[PDF] Towards a Korean DBpedia and an Approach for - CEUR-WSorg