Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/8824
Full metadata record
DC FieldValueLanguage
dc.contributor.authorWirote Aroonmanakun-
dc.contributor.otherChulalongkorn University. Faculty of Arts-
dc.date.accessioned2009-02-18T02:37:59Z-
dc.date.available2009-02-18T02:37:59Z-
dc.date.issued2007-
dc.identifier.citationManusya. 13,[Special Issue],4-17-
dc.identifier.urihttp://cuir.car.chula.ac.th/handle/123456789/8824-
dc.description.abstractThis paper reports on the progress of Thai National Corpus development. The TNC is designed as a general corpus of standard Thai. Only written texts are collected in the first phase. It aims to include at least eighty million words. Various text types produced by various authors are included in the TNC so that it would closely represent written language in general. Texts are word segmented and tagged following the Text Encoding Initiative (TEI) guidelines on text encoding. The TNC was designed as a resource for general applications, such as lexicography, language teaching, and linguistic research. In addition, the TNC is designed to be comparable to the British National Corpus so that a comparative study between the two languages is also possible.en
dc.format.extent311 bytes-
dc.format.mimetypetext/html-
dc.language.isoenes
dc.publisherChulalongkorn Universityen
dc.rightsChulalongkorn Universityen
dc.subjectCorpora (Linguistics)-
dc.subjectThai National Corpus-
dc.subjectThai language-
dc.subjectComputational linguistics-
dc.subjectTranslating and interpreting -- Data processing-
dc.titleCreating the Thai National Corpusen
dc.typeArticlees
dc.email.author[email protected]-
dc.description.publicationAroonmanakun, W. 2007. Creating the Thai National Corpus. Manusaya. Special Issue No.13, 4-17.en
dc.subject.keywordTNCen
dc.subject.keywordcorpus linguisticsen
dc.discipline.code1016es
Appears in Collections:Arts - Journal Articles

Files in This Item:
File Description SizeFormat 
default.html311 BHTMLView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.