RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      Inverted Index based Modified Version of KNN for Text Categorization

      한글로보기

      https://www.riss.kr/link?id=A103734163

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract)

      This research proposes a new strategy where documents are encoded into string vectors and modified version of KNN to be adaptable to string vectors for text categorization. Traditionally, when KNN are used for pattern classification, raw data should b...

      This research proposes a new strategy where documents are encoded into string vectors and modified version of KNN to be adaptable to string vectors for text categorization. Traditionally, when KNN are used for pattern classification, raw data should be encoded into numerical vectors. This encoding may be difficult, depending on a given application area of pattern classification. For example, in text categorization, encoding full texts given as raw data into numerical vectors leads to two main problems: huge dimensionality and sparse distribution. In this research, we encode full texts into string vectors, and modify the supervised learning algorithms adaptable to string vectors for text categorization.

      더보기

      참고문헌 (Reference)

      1 T. Jo, "Text Clustering using NTSO" 558-563, 2005

      2 H. Lodhi, "Text Classification with String Kernels" 2 (2): 419-444, 2002

      3 T. Joachims, "Text Categorization with Support Vector Machines: Learning with many Relevant Features" 143-151, 1998

      4 H. Drucker, "Support Vector Machines for Spam Categorization" 10 (10): 1048-1054, 1999

      5 N. Cristianini, "Support Vector Machines and Other Kernel-based Learning Methods" Cambridge University Press 2000

      6 M. Hearst, "Support Vector Machines" 13 (13): 18-28, 1998

      7 P. Jackson, "Natural Language Processing for Online Applications: Text Retrieval, Extraction and Categorization" John Benjamins Publishing Company 2002

      8 F. Sebastiani, "Machine Learning in Automated Text Categorization" 34 (34): 1-47, 2002

      9 T. Mitchell, "Machine Learning" McGraw-Hill 1997

      10 R. R. Korfahage, "Information Storage and Retrieval" Wiley Computer Publishing 1997

      1 T. Jo, "Text Clustering using NTSO" 558-563, 2005

      2 H. Lodhi, "Text Classification with String Kernels" 2 (2): 419-444, 2002

      3 T. Joachims, "Text Categorization with Support Vector Machines: Learning with many Relevant Features" 143-151, 1998

      4 H. Drucker, "Support Vector Machines for Spam Categorization" 10 (10): 1048-1054, 1999

      5 N. Cristianini, "Support Vector Machines and Other Kernel-based Learning Methods" Cambridge University Press 2000

      6 M. Hearst, "Support Vector Machines" 13 (13): 18-28, 1998

      7 P. Jackson, "Natural Language Processing for Online Applications: Text Retrieval, Extraction and Categorization" John Benjamins Publishing Company 2002

      8 F. Sebastiani, "Machine Learning in Automated Text Categorization" 34 (34): 1-47, 2002

      9 T. Mitchell, "Machine Learning" McGraw-Hill 1997

      10 R. R. Korfahage, "Information Storage and Retrieval" Wiley Computer Publishing 1997

      11 M. E. Ruiz, "Hierarchical Text Categorization Using Neural Networks" 5 (5): 87-118, 2002

      12 D. Mladenic, "Feature Selection for unbalanced class distribution and Naïve Bayes" 256-267, 1999

      13 B. Massand, "Classifying News Stories using Memory based Reasoning" 59-65, 1992

      14 T. Jo, "Class Imbalances versus Small Disjuncts" 6 (6): 40-49, 2004

      15 Y. Yang, "An evaluation of statistical approaches to text categorization" 1 (1): 67-88, 1999

      16 Androutsopoulos, K. Koutsias, "An Experimental Comparison of Naïve Bayes and Keyword-based Anti-spam Filtering with personal email message" 160-167, 2000

      17 E. D. Wiener, "A Neural Network Approach to Topic Spotting in Text" University of Colorado 1995

      18 A. Estabrooks, "A Multiple Resampling Method for Learning from Imbalanced Data Sets" 28 (28): 18-26, 2004

      더보기

      동일학술지(권/호) 다른 논문

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      인용정보 인용지수 설명보기

      학술지 이력

      학술지 이력
      연월일 이력구분 이력상세 등재구분
      2023 평가예정 해외DB학술지평가 신청대상 (해외등재 학술지 평가)
      2020-01-01 평가 등재학술지 유지 (해외등재 학술지 평가) KCI등재
      2012-01-01 평가 등재학술지 선정 (등재후보2차) KCI등재
      2011-01-01 평가 등재후보 1차 PASS (등재후보1차) KCI등재후보
      2009-01-01 평가 등재후보학술지 선정 (신규평가) KCI등재후보
      더보기

      학술지 인용정보

      학술지 인용정보
      기준연도 WOS-KCI 통합IF(2년) KCIF(2년) KCIF(3년)
      2016 0.09 0.09 0.09
      KCIF(4년) KCIF(5년) 중심성지수(3년) 즉시성지수
      0.07 0.06 0.254 0.59
      더보기

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼