Home
 Publications
 Projects
 Past Students
 Teaching
 NLP Toolkit
 Contact
 
 
 
 
 
 
 

About

NLP Toolkit contains implementations of several Natural Language Processing and Machine Learning algorithms. Although initial implementations are based on Turkish language, the system currently contains basic modeling of 3 languages, namely English, Turkish, and Persian.

Algorithms

The algorithms implemented are

Data Sets

The system currently supports reading modules for

References

  1. Ercan, G., O. Erkek, O. Acikgoz, R. Ozcelik, S. Parlar, O. T. Yildiz, "Anlamsal Söylem ve Cümle Benzerliği Analizleri için Veri Kümesi Oluşturma Yöntemi", International Conference on Computer Science and Engineering (UBMK) Sarajevo, Bosnia, 2018.
  2. Ak, K., O. Bakay, O. T. Yildiz, "Comparison of Turkish Proposition Banks by Frame Matching", International Conference on Computer Science and Engineering (UBMK) Sarajevo, Bosnia, 2018.
  3. Ercan, G., O. T. Yildiz, "AnlamVer: Semantic Model Evaluation Dataset for Turkish - Word Similarity and Relatedness", International Conference on Computational Linguistics (COLING) (Best Paper Award), pp. 3819-3836, Santa Fe, USA, 2018.
  4. Ehsani, R., E. Solak, O. T. Yildiz, "Constructing a WordNet for Turkish Using Manual and Automatic Annotation", ACM Transactions on Asian and Low-Resource Language Information Processing, Vol. 17, No. 3, Article 24, 2018.
  5. Ak, K., C. Toprak, V. Esgel, O. T. Yildiz, "Construction of a Turkish Proposition Bank", Turkish Journal of Electrical Engineering & Computer Sciences, Vol. 26, No. 1, pp. 570-581, 2018.
  6. Akcakaya, S., O. T. Yildiz, "An All-Words Sense Annotated Turkish Corpus", International Conference on Natural Language and Speech Processing (ICNLSP), Algiers, Algeria, 2018.
  7. Yildiz, O. T., K. Ak, G. Ercan, O. Topsakal, C. Asmazoglu, "A Multilayer Annotated Corpus for Turkish", International Conference on Natural Language and Speech Processing (ICNLSP), Algiers, Algeria, 2018.
  8. Ertopcu, B., A. B. Kanguroglu, O. Topsakal, O. Acikgoz, A. T. Gurkan, B. Ozenc, I. Cam, B. Avar, G. Ercan, O. T. Yildiz, "A New Approach for Named Entity Recognition", International Conference on Computer Science and Engineering (UBMK), pp. 474-479, Antalya, Turkey, 2017.
  9. Topsakal, O., O. Acikgoz, A. T. Gurkan, A. B. Kanguroglu, B. Ertopcu, B. Ozenc, I. Cam, B. Avar, G. Ercan, O. T. Yildiz, "Shallow Parsing in Turkish", International Conference on Computer Science and Engineering (UBMK), pp. 480-485, Antalya, Turkey, 2017.
  10. Acikgoz, O., A. T. Gurkan, B. Ertopcu, O. Topsakal, B. Ozenc, A. B. Kanguroglu, I. Cam, B. Avar, G. Ercan, O. T. Yildiz, "All-Words Word Sense Disambiguation for Turkish", International Conference on Computer Science and Engineering (UBMK), pp. 490-495, Antalya, Turkey, 2017.
  11. Sasmaz, E., R. Ehsani, O. T. Yildiz, "Hypernym extraction from Wikipedia and Wiktionary", Signal Processing and Communication Applications Conference (SIU), Antalya, Turkey, 2017.
  12. Ehsani, R., O. T. Yildiz, "Initial Efforts in Creating a Persian-English Parallel Treebank", International Conference on Intelligent Text Processing and Computational Linguistics (CICLING), Budapest, Hungary, 2017.
  13. Ehsani, R., E. Solak, O. T. Yildiz, "Hybrid Chunking for Turkish Combining Morphological and Semantic Features", International Conference on Intelligent Text Processing and Computational Linguistics (CICLING), Budapest, Hungary, 2017.
  14. Gorgun, O., O. T. Yildiz, E. Solak, R. Ehsani, "English-Turkish Parallel Treebank with Morphological Annotations and its Use in Tree-based SMT", International Conference on Pattern Recognition and Methods (ICPRAM), pp. 510-516, Rome, Italy, 2016.
  15. Solak, E., O. T. Yildiz, O. Gorgun, R. Ehsani, "Attachment Errors of Nouns after Possessor Clitic", Research in Computing Science, Vol. 90, pp. 173-181, 2015.
  16. Yildiz, O. T., S. Candir, E. Solak, R. Ehsani, O. Gorgun, "Constructing a Turkish Constituency Parse TreeBank", International Conference on Computer and Information Sciences (ISCIS), pp. 339-347, Krakow, Poland, 2015.
  17. Yildiz, O. T., E. Solak, R. Ehsani, O. Gorgun, "Chunking in Turkish with Conditional Random Fields", International Conference on Intelligent Text Processing and Computational Linguistics (CICLING), Cairo, Egypt, 2015.
  18. Duzagac, R., O. T. Yildiz, "Context Sensitive Search Engine", International Conference on Computer and Information Sciences (ISCIS), pp. 277-284, Krakow, Poland, 2014.
  19. Yildiz, O. T., E. Solak, O. Gorgun, R. Ehsani, "Constructing a Turkish-English Parallel Treebank", Annual Meeting of the Association for Computational Linguistics (ACL), Baltimore, U.S.A., 2014.
  20. Yildiz, O. T., A. Okutan, E. Solak, "Bilingual Software Requirements Tracing using Vector Space Model", International Conference on Pattern Recognition and Methods (ICPRAM), Angers, France, 2014.
  21. Gorgun, O, O. T. Yildiz, "Using Morphology In English-Turkish Statistical Machine Translation", Signal Processing and Communication Applications Conference (SIU), Antalya, Turkey, 2012.
  22. Gorgun, O, O. T. Yildiz, "A Novel Approach to Morphological Disambiguation for Turkish", International Conference on Computer and Information Sciences (ISCIS), pp. 77-83, London, UK, 2011.
  23. Ak, K, O. T. Yildiz, "Unsupervised Morphological Analysis Using Tries", International Conference on Computer and Information Sciences (ISCIS), pp. 69-75, London, UK, 2011.