딥러닝 기반 한국 고전한문 표점 추론 자동화 모델의 구축과 활용|RISS 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

딥러닝 기반 한국 고전한문 표점 추론 자동화 모델의 구축과 활용 = Development and Application of a Deep Learning–Based Model for Automated Punctuation Inference in Korean Classical Chinese

한글로보기

https://www.riss.kr/link?id=A110055995

저자

양정현 (국립순천대학교 지리산권문화연구원)
발행기관
호남사학회(Chonnam Historical Association)
학술지명
역사학연구(Chonnam Historical Review)
권호사항

Vol.100 No.- [2025]
발행연도
2025
작성언어
Korean
주제어

디지털 인문학 ; 딥러닝 ; 한국 고전한문 ; 표점 추론 ; 오픈소스 표점 지정 시 스템 ; Digital humanities ; Deep learning ; Korean Classical Chinese ; Punctuation inference ; Open-source punctuation system
등재정보
KCI등재
자료형태
학술저널
수록면

267-297(31쪽)
DOI식별코드
http://dx.doi.org/10.37924/JSSW.100.9
제공처
KISS

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

This study compiles and refines collated and punctuated Classical Chinese texts accumulated through prior research and projects to construct a database of approximately 3.4 million items (≈420 million characters). Building on this resource, we develop a punctuation inference model specialized for Korean Classical Chinese by fine-tuning the pretrained deep learning language model Chinese-RoBERTa into a multi-label token classification architecture. The training corpus—covering eight genres including annals, collected works, and diaries—was preprocessed and standardized to seven punctuation marks (, 。 · ? ! 《》). The final model achieves an overall F1 score of 0.9050 on held-out validation data. On unseen corpora containing only traditional ring-dot punctuation (Hanguk Munjip Chonggan and Ilseongnok), the model attains F1 scores of 0.8784 and 0.9065, respectively, for punctuation-position matching. By punctuation type, question marks, commas, periods, and middle dots exhibit high performance, whereas book-title brackets (《》)— which require long-range dependencies in paired structures—and exclamation marks—sparse in the data—show lower recall. We release an open-source integrated system—including model weights, training data, source code, and GUI/CLI batch processing—to support records and information services and research workflows using natural-language analysis, such as text preprocessing, indexing and search, translation preprocessing, and OCR postprocessing. Future work includes a dual-path architecture for paired punctuation, genre-adaptive modules, and multi-task integration with sentence-structure analysis and named-entity recognition.

번역하기

This study compiles and refines collated and punctuated Classical Chinese texts accumulated through prior research and projects to construct a database of approximately 3.4 million items (≈420 million characters). Building on this resource, we devel...

This study compiles and refines collated and punctuated Classical Chinese texts accumulated through prior research and projects to construct a database of approximately 3.4 million items (≈420 million characters). Building on this resource, we develop a punctuation inference model specialized for Korean Classical Chinese by fine-tuning the pretrained deep learning language model Chinese-RoBERTa into a multi-label token classification architecture. The training corpus—covering eight genres including annals, collected works, and diaries—was preprocessed and standardized to seven punctuation marks (, 。 · ? ! 《》). The final model achieves an overall F1 score of 0.9050 on held-out validation data. On unseen corpora containing only traditional ring-dot punctuation (Hanguk Munjip Chonggan and Ilseongnok), the model attains F1 scores of 0.8784 and 0.9065, respectively, for punctuation-position matching. By punctuation type, question marks, commas, periods, and middle dots exhibit high performance, whereas book-title brackets (《》)— which require long-range dependencies in paired structures—and exclamation marks—sparse in the data—show lower recall. We release an open-source integrated system—including model weights, training data, source code, and GUI/CLI batch processing—to support records and information services and research workflows using natural-language analysis, such as text preprocessing, indexing and search, translation preprocessing, and OCR postprocessing. Future work includes a dual-path architecture for paired punctuation, genre-adaptive modules, and multi-task integration with sentence-structure analysis and named-entity recognition.

더보기

동일학술지(권/호) 다른 논문

진도 벽파진해전 전적지의 국가유산 가치와 활용 고찰
- 호남사학회
- 이수경
- 2025
- KCI등재
나말려초 高僧碑文의 후백제·후고구려 관련 기록 검토
- 호남사학회
- 박수정
- 2025
- KCI등재
정유재란 시기의 명량대첩에서 碧波津의 역할
- 호남사학회
- 김강식
- 2025
- KCI등재
조선후기 南桃鎭 만호의 관인적 위상과 역할 변화
- 호남사학회
- 나영훈
- 2025
- KCI등재

동일학술지 더보기

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
동일학술지(권/호) 다른 논문
분석정보
연관 공개강의(KOCW)

해외이동버튼