http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.
변환된 중국어를 복사하여 사용하시면 됩니다.
The SVM based Uyghur Text Classification and its Performance Analysis
Palidan Tuerxun,Fang Dingyi,Askar Hamdulla 보안공학연구지원센터 2015 International Journal of Multimedia and Ubiquitous Vol.10 No.4
This paper mainly explores the use of Support Vector Machines (SVMs) for Uyghur text classification, presents the process of text categorization: Text preprocessing, feature dimensionality reduction, representation method and classification of text features etc., discusses the SVMs classification algorithm in the application of Uyghur text classification. Focus on the construction of text categorization model and its procedures. Experiment results show that training by using the selected training data with the guarantee of the performance of the classifier, has higher efficiency than other nearest neighbor classifier (KNN), Naive Bayes (NB) classifier with increased accuracy.
Li, Xun,Yang, Shun-E.,Guo, Yun-Quan,Shen, Ming-Xia,Gu, Li,Gulikezi, Gulikezi,Zhao, Bing,Liu, Wei,Tuerxun, Tuerxun,Bai, Jing-Ping Asian Pacific Journal of Cancer Prevention 2012 Asian Pacific journal of cancer prevention Vol.13 No.12
Objective: To investigate the relationship between plasma EBV-DNA concentration and clinicopathologic features of Hodgkin's lymphoma cases. Methods: At first, the positive rate of plasma EBV-DNA was determined with a nested-PCR method using 45 specimens from Uygur HL patients, as well as 110 healthy people sampled as normal controls. Secondly, using fluorescent quantitative nested-PCR, EBV viral load was assessed in the EBV-DNA positive plasma samples. Then, relationships between plasma EBV viral load and clinicopathologic features of HL patients were analyzed. Results: The positive rate of plasma EBV-DNA of HL patients was significantly higher than that of normal controls (53.3% vs 26.4%, P=0.001). There was no significant difference about plasma EBV viral load between EBV-associated HL and EBV-DNA positive normal people (P=0.490). Looking at patients' characteristics, plasma EBV viral load in 10-20 years EBV-associated HL was higher than in EBV cases which were less than 10 years or more than 35 years (P=0.025). Furthermore, in EBV-associated HL, concentration of plasma EBV-DNA was significantly higher in advanced stage disease (stages III-IV; P=0.013), and with B-symptoms (P=0.020). Conclusion: EBV-DNA levels were associated with part of clinicopathologic features of cases. It was of practical use to screen HL. Further etiological studies appear warranted.
A Survey of Uyghur Person Name Recognition
Tashpolat Nizamidin,Palidan Tuerxun,Askar Hamdulla,Muhtar Arkin 보안공학연구지원센터 2016 International Journal of Signal Processing, Image Vol.9 No.3
Uyghur is one of the most populous and civilized groups with Turkic ethnicity and mainly located Xinjiang Uyghur Autonomous Region of China. Uyghur language belongs to the Karluk branch of the Turkic language family in Altaic language system, and holds agglutinative characteristics in morphological structure. Named Entity Recognition (NER) is an Information Extraction task that has become an essential part of Natural Language Processing (NLP) tasks, such as Machine Translation and Information Retrieval. In this paper, as a subtask of NER, the importance of Uyghur Named Entity Recognition (UPNR) task is demonstrated, the main characteristics of the Uyghur language are highlighted, and the aspects of standardization in annotating named entities are illustrated. Moreover, the approaches used in Uyghur NPNR field are explained and the features of common tools used in Uyghur NPNR are described. A brief review of the state of the art of Uyghur NPNR research is discussed, too. Finally, we present our conclusions. Throughout the presentation, illustrative examples are used for clarification.
Uyghur Stemming and Lemmatization Approach based on Multi-Morphological Features
Abdurahim Mahmoud,Sediyegvl Enwer,Abdusalam Dawut,Palidan Tuerxun,Askar Hamdulla 보안공학연구지원센터 2016 International Journal of Database Theory and Appli Vol.9 No.11
This paper describes a stemming and lemmatization approach for Uyghur using Conditional Random Fields (CRFs). In the proposed approach, we used syllable-level training and test corpus with the combination of some automatically tagged positional and morphological feature tags. The training and test corpus has been manually tagged with a stemming tag set which includes eight kinds of tags which fully reflect the morphological feature of Uyghur word. It has been observed that some morphological features are very helpful for improving the evaluating results. The syllable-level Precision, Recall and F-score of the best evaluation result respectively are 98.79%,98.71% and 98.75% respectively, and the word-level accuracy we achieved is 95.9%.The experimental results show that the efficiency of this approach is very ideal.
Xiaoyan Hao,Jianping Li,Shengqi Gao,Zumuremu Tuerxun,Xiaochun Chang,Wenran Hu,Guo Chen,Quansheng Huang 한국유전학회 2020 Genes & Genomics Vol.42 No.12
Background: Abiotic stress effects agricultural production, so research on improving stress tolerance of crop is important. Suaeda salsa is a halophyte with high salt and drought tolerance and ability to desalinate saline soil and improve soil quality. Objective: To discover and utilize of salt and drought tolerance-related genes, we further investigated the mechanisms of salt and drought tolerance. Methods: Through screening a salt treated Suaeda salsa cDNA library and further cloning a H subunit of the photosystem I reaction center SsPsaH cDNA, and then the protein domain and phylogenetic analyses of PSI genes was conducted with the NCBI Blast, DNAMAN, and MotifScan programs. The S. salsa seedlings were subjected to various stress treatments and analyze expression of SsPsaH under these treatments by real-time RT-PCR. SsPsaH expression construct was introduced into S. pombe cells by electroporation and transformed into N. tabacum plants by the leaf disc transformation method. Results: A member of the H subunit of the Photosystem I reaction center (defined as SsPsaH) was obtained. The expression of SsPsaH was up-regulated by abscisic acid (ABA), salt, and drought stress treatments. Over-expressing SsPsaH in recombinant yeasts enhanced high salinity tolerance and increased tolerance to sorbitol during seed germination and seedling root development in tobacco, respectively. Some stress-related mark genes such as a LEA family gene of NtLEA, a binding protein of a drought response element of NtDREB, the ascorbate peroxidase gene (NtAPX) were also up-regulated in SsPsaH overexpressing transgenic tobacco lines. Conclusions: These results show that SsPsaH may contribute to the salt and osmotic stress response of plants.