aflat

Data-Driven Techniques in African Language Technology

Data-Driven Techniques in African Language Technology
Project information
Abstract: 

In this project we investigate the applicability of machine learning techniques (supervised and unsupervised methods) to various language technology problems for African languages. 

Hoogeveen, D., & De Pauw G. (2011).  CorpusCollie - A Web Corpus Mining Tool for Resource-Scarce Languages. Proceedings of Conference on Human Language Technology for Development. 44-49. PDF
Kituku, B., Wagacha P. W., & De Pauw G. (2011).  A Memory-Based Approach to Kĩkamba Named Entity Recognition. Proceedings of Conference on Human Language Technology for Development. 106-111. PDF
Chege, K., Ng'Ang'a W., Wagacha P. W., De Pauw G., & Mutiga J. (2011).  Morphological Analysis of Gĩkũyũ using a Finite State Machine. Proceedings of Conference on Human Language Technology for Development. 112-117. PDF
F. Kitoogo, V. Baryamureeba, & De Pauw G. (2008).  Towards Domain Independent Named Entity Recognition. Strengthening the Role of ICT in Development. 38-49.
Wagacha, P. W., De Pauw G., & P.W. Githinji (2006).  A Grapheme-Based Approach for Accent Restoration in Gĩkũyũ. Proceedings of the Fifth International Conference on Language Resources and Evaluation. 1937-1940.
Syndicate content