RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      검색결과 좁혀 보기

      선택해제
      • 좁혀본 항목 보기순서

        • 원문유무
        • 원문제공처
          펼치기
        • 등재정보
          펼치기
        • 학술지명
          펼치기
        • 주제분류
          펼치기
        • 발행연도
          펼치기
        • 작성언어
        • 저자
          펼치기

      오늘 본 자료

      • 오늘 본 자료가 없습니다.
      더보기
      • 무료
      • 기관 내 무료
      • 유료
      • KCI등재

        Performance Evaluation of a Feature-Importance-based Feature Selection Method for Time Series Prediction

        안현 한국정보통신학회 2023 Journal of information and communication convergen Vol.21 No.1

        Various machine-learning models may yield high predictive power for massive time series for time series prediction. However, these models are prone to instability in terms of computational cost because of the high dimensionality of the feature space and nonoptimized hyperparameter settings. Considering the potential risk that model training with a high-dimensional feature set can be time-consuming, we evaluate a feature-importance-based feature selection method to derive a tradeoff between predictive power and computational cost for time series prediction. We used two machine learning techniques for performance evaluation to generate prediction models from a retail sales dataset. First, we ranked the features using impurity- and Local Interpretable Model-agnostic Explanations (LIME) -based feature importance measures in the prediction models. Then, the recursive feature elimination method was applied to eliminate unimportant features sequentially. Consequently, we obtained a subset of features that could lead to reduced model training time while preserving acceptable model performance.

      • KCI등재

        Stacked Autoencoder 기반 악성코드 Feature 정제 기술 연구

        김홍비(Hong-bi Kim),이태진(Tae-jin Lee) 한국정보보호학회 2020 정보보호학회논문지 Vol.30 No.4

        네트워크의 발전에 따라 악성코드 생성도구가 유포되는 등으로 인해 악성코드의 출현이 기하급수적으로 증가하였으나 기존의 악성코드 탐지 방법을 통한 대응에는 한계가 존재한다. 이러한 상황에 따라 머신러닝 기반의 악성 코드 탐지 방법이 발전하는 추세이며, 본 논문에서는 머신러닝 기반의 악성 코드 탐지를 위해 PE 헤더에서 데이터의 feature를 추출한 후 이를 이용하여 autoencoder를 통해 악성코드를 더 잘 나타내는 feature 및 feature importance를 추출하는 방법에 대한 연구를 진행한다. 본 논문은 악성코드 분석에서 범용적으로 사용되는 PE 파일에서 확인 가능한 DLL/API 등의 정보로 구성된 549개의 feature를 추출하였고 머신러닝의 악성코드 탐지 성능 향상을 위해 추출된 feature를 이용하여 autoencoder를 통해 데이터를 압축적으로 저장함으로써 데이터의 feature를 효과적으로 추출해 우수한 정확도 제공 및 처리 시간을 2배 단축에 성공적임을 증명하였다. 시험 결과는 악성코드 그룹 분류에도 유용함을 보였으며, 향후 SVM과 같은 분류기를 도입하여 더욱 정확한 악성코드 탐지를 위한 연구를 이어갈 예정이다. The advent of malicious code has increased exponentially due to the spread of malicious code generation tools in accordance with the development of the network, but there is a limit to the response through existing malicious code detection methods. According to this situation, a machine learning-based malicious code detection method is evolving, and in this paper, the feature of data is extracted from the PE header for machine-learning-based malicious code detection, and then it is used to automate the malware through autoencoder. Research on how to extract the indicated features and feature importance. In this paper, 549 features composed of information such as DLL/API that can be identified from PE files that are commonly used in malware analysis are extracted, and autoencoder is used through the extracted features to improve the performance of malware detection in machine learning. It was proved to be successful in providing excellent accuracy and reducing the processing time by 2 times by effectively extracting the features of the data by compressively storing the data. The test results have been shown to be useful for classifying malware groups, and in the future, a classifier such as SVM will be introduced to continue research for more accurate malware detection.

      • KCI등재

        Quantitative analysis of pulse arrival time and PPG morphological features based cuffless blood pressure estimation: a comparative study between diabetic and non‑diabetic groups

        Seongryul Park,Seungjae Lee,Eunkyoung Park,Jongshill Lee,In Young Kim 대한의용생체공학회 2023 Biomedical Engineering Letters (BMEL) Vol.13 No.4

        Pulse arrival time (PAT) and PPG morphological features have attracted much interest in cuffless blood pressure (BP) estimation,but their effects are not clearly understood when vascular characteristics are affected by diseases such as diabetes. This work quantitatively analyzes the effect of diabetic disease on the PAT and PPG morphological features-based BP estimation. We selected 112 diabetic patients and 308 non-diabetic subjects from VitalDB, and extracted 16 features includingPAT, PPG morphological features, and heart rate. BP estimation performance was statistically compared between groupsusing linear regression models with several feature sets, and the relative importance of each feature in the optimal featureset was extracted. As a result, the standard deviation of the error and mean absolute error of PAT-based BP estimation weresignificantly higher in the diabetic group than in the non-diabetic group (p < 0.01). A feature set containing PAT and PPGmorphological features achieved the best performance in both groups. However, the relative importance of each feature forBP estimation differed notably between groups. The results indicate that different features are important depending on thevascular characteristics, which could help to construct different models to accommodate specific diseases.

      • KCI우수등재

        빅데이터 분석기법을 이용한 중소기업 성장 예측 모델 연구

        모혜란,김현경,김 현 대한전자공학회 2023 전자공학회논문지 Vol.60 No.3

        Through this paper, we would like to introduce a predictive model for the future growth potential of SMEs based on corporate big data analysis. In particular, financial data is the most important variable related to corporate growth. In previous studies, financial status is frequently used to predict corporate growth potential. However, in this paper, the company's financial data and the company's stock price are used as major variables to predict the company's growth potential. Based on Feature Importance, major variables related to corporate growth were selected. It was confirmed that the company's financial position and stock price are related to each other using the K-Means algorithm. This is because various indicators such as the possibility of a company's entry/expansion into the market, technological advantage/discrimination and expertise, management ability, growth, profitability, and stability are reflected in the company's stock price. In this paper, we were able to propose a model that can predict a company's growth potential using PCA and Feature Importance. 우리는 본 논문을 통해 기업의 빅데이터 분석을 바탕으로 중소기업의 미래 성장 가능성에 대한 예측 모델 소개하고자 한다. 특히 재무 데이터는 기업의 성장과 관련된 가장 중요한 변수인데, 기존 연구들에서 기업 성장 가능성 예측에 빈번하게 사용되고 있다. 그러나 본 논문에서는 기업의 재무 데이터와 기업의 주가가 기업 성장 가능성을 예측하는 주요 변수로 활용하였는데, Feature Importance를 기반으로 기업 성장과 관련 있는 주요 변수들을 선택하고 이는 K-Means 알고리즘을 활용해 기업의 재무상태와 주가가 서로 연관이 있음을 확인하였다. 특히 기업의 시장진입/확대가능성, 기술우위/차별성 및 전문성, 경영능력, 성장성, 수익성, 안정성 등의 다양한 지표들이 주가에 반영되기 때문이다. 우리는 본 논문을 통해 PCA와 Feature Importance를 이용해 기업의 성장 가능성을 예측할 수 있는 모델을 제안할 수 있었다.

      • KCI등재

        Research on the Sequence Planning of Manufacturing Feature Based on the Node Importance of Complex Network

        Bin Cheng,Dingjie Guan,Bingxue Jing 한국정밀공학회 2022 International Journal of Precision Engineering and Vol.23 No.2

        Small and medium-sized manufacturing enterprises involve a lot of customized products. The degree of adaptability should be noted while improving product design and manufacturing digital and intelligent levels. This paper presents a process sequencing method of manufacturing features based on the node importance of a complex network. The method is based on the adjacency matrix and connected graph to analyze the process constraint semantics of the product model. The adjacency matrix expresses the positioning dimensions between features. The connected graph is applied to define the constraint relationships between features and aggregate the multi-dimensional process dimension chain in all directions. Based on the processing sequence of node importance in a complex network, most of process planning can be realized. The method also can make adaptive decisions for different structural parts and monitor the machining of key features. Examples verify the validity and feasibility of the proposed method.

      • KCI등재

        머신러닝 분류 알고리즘을 활용한 선박 접안속도 영향요소의 중요도 분석

        이형탁,이상원,조장원,조익순 해양환경안전학회 2020 해양환경안전학회지 Vol.26 No.2

        The most important factor affecting the berthing energy generated when a ship berths is the berthing velocity. Thus, an accident may occur if the berthing velocity is extremely high. Several ship features influence the determination of the berthing velocity. However, previous studies have mostly focused on the size of the vessel. Therefore, the aim of this study is to analyze various features that influence berthing velocity and determine their respective importance. The data used in the analysis was based on the berthing velocity of a ship on a jetty in Korea. Using the collected data, machine learning classification algorithms were compared and analyzed, such as decision tree, random forest, logistic regression, and perceptron. As an algorithm evaluation method, indexes according to the confusion matrix were used. Consequently, perceptron demonstrated the best performance, and the feature importance was in the following order: DWT , jetty number, and state. Hence, when berthing a ship, the berthing velocity should be determined in consideration of various features, such as the size of the ship, position of the jetty, and loading condition of the cargo. 선박이 접안할 때 발생하는 접안에너지에 가장 영향력이 큰 요소는 접안속도이며, 과도한 경우 사고로 이어질 수 있다. 접안속도의 결정에 영향을 미치는 요소는 다양하지만 기존 연구에서는 일반적으로 선박 크기에 제한하여 분석하였다. 따라서 본 연구에서는 다양한 선박 접안속도의 영향요소를 반영하여 분석하고 그에 따른 중요도를 도출하고자 한다. 분석에 활용한 데이터는 국내 한 탱커부두의 선박 접안속도를 실측한 것을 바탕으로 하였다. 수집된 데이터를 활용하여 머신러닝 분류 알고리즘인 의사결정나무(Decision Tree), 랜덤포레스트(Random Forest), 로지스틱회귀(Logistic Regression), 퍼셉트론(Perceptron)을 비교분석하였다. 알고리즘 평가 방법으로는 혼동 행렬에 따른 모델성능 평가지표를 사용하였다. 분석 결과, 가장 성능이 좋은 알고리즘으로는 퍼셉트론이 채택되었으며 그에 따른 접안속도 영향요인의 중요도는 선박 크기(DWT), 부두 위치(Jetty No.), 재화상태(State) 순으로 나타났다. 이에 따라 선박 접안 시, 선박의 크기를 비롯하여 부두 위치, 재화 상태 등 다양한 요인을 고려하여 접안속도를 설계하여야 한다.

      • KCI등재

        Machine learning-based prediction of Sasang constitution types using comprehensive clinical information and identification of key features for diagnosis

        박사윤,Musun Park,Won-Yung Lee,Choong-Yeol Lee,Ji-Hwan Kim,이시우,김창업 한국한의학연구원 2021 Integrative Medicine Research Vol.10 No.3

        Background: Despite the importance of accurate Sasang type diagnosis, a unique form of Korean medicine, there have been concerns about consistency among diagnoses. We investigate a data-driven integrative diagnostic model by applying machine learning to a multicenter clinical dataset with comprehensive features. Methods: Extremely randomized trees (ERT), support vector machines, multinomial logistic regression, and K-nearest neighbor were applied, and performances were evaluated by cross-validation. The feature importance of the classifier was analyzed to understand which information is crucial in diagnosis. Results: The ERT classifier showed the highest performance, with an overall f1 score of 0.60 ± 0.060. The feature classes of body measurement, personality, general information, and cold–heat were more decisive than others in classifying Sasang types. Costal angle was the most informative feature. In pairwise classification, we found Sasang type-dependent distinctions that body measurement features played a key role in TE-SE and TE-SY datasets, while personality and cold–heat features showed importance in SE-SY dataset. Conclusion: Current study investigated a comprehensive diagnostic model for Sasang type using machine learning and achieved better performance than previous studies. This study helps data-driven decision making in clinics by revealing key features contributing to the Sasang type diagnosis.

      • KCI우수등재

        A Quantitative Analysis of the Little Red Riding Hood Types and Story Element-Function-Plot Relations

        박정식,한호 한국영어학회 2022 영어학 Vol.22 No.-

        Tale types of “Little Red Riding Hood” have survived through oral transmission in various areas including Europe, Africa, and Asia and can even be traced back to 10th century in a written form. This research presents quantitative analyses on the folkloric landscape of tales of, or related to, what is best known as Little Red Riding Hood through the Aarne-Thompson-Uther (ATU) index, of which we analyzed ATU 333, ATU 123, and other unspecified types, based on logistic regression and decision tree. The quantitative analyses of the Little Red Riding Hood tale types indicate that ATU 123 alone has the specific story segments that are important to the formation of the tale type and that though diversified in story segments and other details, the three types shared the distinct plot sequence as an important feature. In addition, eight event descriptors and six character and setting descriptors are found to be meaningful factors in the formation of ATU 123. It can be further argued that the plot as an abstraction played a major role in the formation of the tales we have now. Also demonstrated in this paper is that researchers can yield substantial insights into the quantitative results while cross-checking them with qualitative analyses.

      • 서울시 열섬현상 완화를 위한 바람길 및 녹지 입지 선정 제안 : 클러스터링 및 랜덤포레스트 변수 중요도를 활용한 열섬척도 제안을 중심으로

        간정현,남승희,서윤지,전은지,이광춘 한국디지털융합학회 2019 디지털경영연구 Vol.6 No.1

        As the climate problem becomes more serious, mitigating the heat island effect is becoming an important issue. To solve the problem, it is necessary to identify the areas where the heat island effect is more severe and present appropriate measures to those areas. In this paper, we divided Seoul into 573 grids and divided into four clusters by hierarchical clustering techniques to identify the current status of the city's heat island effect. Of these cluster, we chose the one with the most urban features. We extracted the variable importance with random forest models and developed the heat island index from 1) multiplication of the variable importance and 2) the sign of the correlation coefficient between temperature and 3) the corresponding variable. Furthermore we analyzed areas where the heat island effect is severe with the index. According to the analysis, the heat island effect was particularly severe in five grids, and we suggested ways to mitigate the heat island effect for the areas concerned.

      • KCI등재

        중·고령자 자살생각 예측모델 개발 및 요인분석 -머신러닝과 전통적 통계기법 혼합사례연구-

        정현우,장진수 사단법인 대한보건협회 2024 대한보건연구 Vol.50 No.1

        -Purpose:This study aims to develop a predictive model and identify crucial variables associated with suicidal ideation, employing factor analysis. -Methods:Distinct models were created for both the overall and older adult population. Six machine learning algorithms were applied to construct predictive models and assess feature importance. Traditional logistic regression analysis was conducted for factor analysis. -Results:Gradient Boosting and SVM stood out, highlighting anxiety and depression as pivotal variables. For the older adults, anxiety & depression, future anxiety, and being a care recipient were crucial features. Logistic regression analysis indicated the significance of mental and physical health, along with residential factors. -Conclusion:The machine learning results closely aligned with outcomes from traditional statistical models, showcasing generally reliable findings. Suicidal ideation appears linked to situations that are challenging to overcome through individual efforts alone. This study emphasizes the necessity for cultural and policy programs fostering inclusivity throughout South Korea.

      연관 검색어 추천

      이 검색어로 많이 본 자료

      활용도 높은 자료

      해외이동버튼