RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      검색결과 좁혀 보기

      선택해제

      오늘 본 자료

      • 오늘 본 자료가 없습니다.
      더보기
      • 무료
      • 기관 내 무료
      • 유료
      • Bayesian Survival Analysis of High-Dimensional Microarray Data for Mantle Cell Lymphoma Patients

        Moslemi, Azam,Mahjub, Hossein,Saidijam, Massoud,Poorolajal, Jalal,Soltanian, Ali Reza Asian Pacific Journal of Cancer Prevention 2016 Asian Pacific journal of cancer prevention Vol.17 No.1

        Background: Survival time of lymphoma patients can be estimated with the help of microarray technology. In this study, with the use of iterative Bayesian Model Averaging (BMA) method, survival time of Mantle Cell Lymphoma patients (MCL) was estimated and in reference to the findings, patients were divided into two high-risk and low-risk groups. Materials and Methods: In this study, gene expression data of MCL patients were used in order to select a subset of genes for survival analysis with microarray data, using the iterative BMA method. To evaluate the performance of the method, patients were divided into high-risk and low-risk based on their scores. Performance prediction was investigated using the log-rank test. The bioconductor package "iterativeBMAsurv" was applied with R statistical software for classification and survival analysis. Results: In this study, 25 genes associated with survival for MCL patients were identified across 132 selected models. The maximum likelihood estimate coefficients of the selected genes and the posterior probabilities of the selected models were obtained from training data. Using this method, patients could be separated into high-risk and low-risk groups with high significance (p<0.001). Conclusions: The iterative BMA algorithm has high precision and ability for survival analysis. This method is capable of identifying a few predictive variables associated with survival, among many variables in a set of microarray data. Therefore, it can be used as a low-cost diagnostic tool in clinical research.

      • KCI등재

        Real-Data Comparison of Data Mining Methods in Prediction of Diabetes in Iran

        Lily Tapak,Hossein Mahjub,Omid Hamidi,Jalal Poorolajal 대한의료정보학회 2013 Healthcare Informatics Research Vol.19 No.3

        Objectives: Diabetes is one of the most common non-communicable diseases in developing countries. Early screening and diagnosis play an important role in effective prevention strategies. This study compared two traditional classification methods (logistic regression and Fisher linear discriminant analysis) and four machine-learning classifiers (neural networks, support vector machines, fuzzy c-mean, and random forests) to classify persons with and without diabetes. Methods: The data set used in this study included 6,500 subjects from the Iranian national non-communicable diseases risk factors surveillance obtained through a cross-sectional survey. The obtained sample was based on cluster sampling of the Iran population which was conducted in 2005–2009 to assess the prevalence of major non-communicable disease risk factors. Ten risk factors that are commonly associated with diabetes were selected to compare the performance of six classifiers in terms of sensitivity, specificity, total accuracy, and area under the receiver operating characteristic (ROC) curve criteria. Results: Support vector machines showed the highest total accuracy (0.986) as well as area under the ROC (0.979). Also, this method showed high specificity (1.000) and sensitivity (0.820). All other methods produced total accuracy of more than 85%, but for all methods, the sensitivity values were very low (less than 0.350). Conclusions: The results of this study indicate that, in terms of sensitivity, specificity, and overall classification accuracy, the support vector machine model ranks first among all the classifiers tested in the prediction of diabetes. Therefore, this approach is a promising classifier for predicting diabetes, and it should be further investigated for the prediction of other diseases.

      • KCI등재

        Predicting 5-Year Survival Status of Patients with Breast Cancer based on Supervised Wavelet Method

        Maryam Farhadian,Hossein Mahjub,Jalal Poorolajal,Abbas Moghimbeigi,Muharram Mansoorizadeh 질병관리본부 2014 Osong Public Health and Research Persptectives Vol.5 No.6

        Objectives: Classification of breast cancer patients into different risk classes is very important in clinical applications. It is estimated that the advent of highdimensional gene expression data could improve patient classification. In this study, a new method for transforming the high-dimensional gene expression data in a low-dimensional space based on wavelet transform (WT) is presented. Methods: The proposed method was applied to three publicly available microarray data sets. After dimensionality reduction using supervised wavelet, a predictive support vector machine (SVM) model was built upon the reduced dimensional space. In addition, the proposed method was compared with the supervised principal component analysis (PCA). Results: The performance of supervised wavelet and supervised PCA based on selected genes were better than the signature genes identified in the other studies. Furthermore, the supervised wavelet method generally performed better than the supervised PCA for predicting the 5-year survival status of patients with breast cancer based on microarray data. In addition, the proposed method had a relatively acceptable performance compared with the other studies. Conclusion: The results suggest the possibility of developing a new tool using wavelets for the dimension reduction of microarray data sets in the classification framework.

      • KCI등재

        Factors associated with mortality from tuberculosis in Iran: an application of a generalized estimating equation-based zero-inflated negative binomial model to national registry data

        Fatemeh Sarvi,Abbas Moghimbeigi,Hossein Mahjub,Mahshid Nasehi,Mahmoud Khodadost 한국역학회 2019 Epidemiology and Health Vol.41 No.-

        OBJECTIVES: Tuberculosis (TB) is a global public health problem that causes morbidity and mortality in millions of people per year. The purpose of this study was to examine the relationship of potential risk factors with TB mortality in Iran. METHODS: This cross-sectional study was performed on 9,151 patients with TB from March 2017 to March 2018 in Iran. Data were gathered from all 429 counties of Iran by the Ministry of Health and Medical Education and Statistical Center of Iran. In this study, a generalized estimating equation-based zero-inflated negative binomial model was used to determine the effect of related factors on TB mortality at the community level. For data analysis, R version 3.4.2 was used with the relevant packages. RESULTS: The risk of mortality from TB was found to increase with the unemployment rate (βˆ=0.02), illiteracy (βˆ=0.04), household density per residential unit (βˆ=1.29), distance between the center of the county and the provincial capital (βˆ=0.03), and urbanization (βˆ=0.81). The following other risk factors for TB mortality were identified: diabetes (βˆ=0.02), human immunodeficiency virus infection (βˆ=0.04), infection with TB in the most recent 2 years (βˆ=0.07), injection drug use (βˆ=0.07), long-term corticosteroid use (βˆ=0.09), malignant diseases (βˆ=0.09), chronic kidney disease (βˆ=0.32), gastrectomy (βˆ=0.50), chronic malnutrition (βˆ=0.38), and a body mass index more than 10% under the ideal weight (βˆ=0.01). However, silicosis had no effect. CONCLUSIONS: The results of this study provide useful information on risk factors for mortality from TB.

      • SCOPUSKCI등재

        Estimation of the Frequency of Intravenous Drug Users in Hamadan City, Iran, Using the Capture-recapture Method

        Salman Khazaei,Jalal Poorolajal,Hossein Mahjub,Nader Esmailnasab,Mohammad Mirzaei 한국역학회 2012 Epidemiology and Health Vol.34 No.-

        OBJECTIVES: The number of illicit drug users is prone to underestimation. This study aimed to use the capture-recapture method as a statistical procedure for measuring the prevalence of intravenous drug users (IDUs) by estimating the number of unknown IDUs not registered by any of the registry centers. METHODS: This study was conducted in Hamadan City, the west of Iran, in 2012. Three incomplete data sources of IDUs, with partial overlapping data, were assessed including: (a) Volunteer Counseling and Testing Centers(VCTCs); (b) Drop in Centers (DICs); and (c) Outreach Teams (ORTs). A log-linear model was applied for the analysis of three-sample capture-recapture results. Two information criteria were used for model selection including Akaike’s Information Criterion and the Bayesian Information Criterion. RESULTS: Out of 1,478 IDUs registered by three centers, 48% were identified by VCTCs, 32% by DICs, and 20% by ORTs. After exclusion of duplicates, 1,369 IDUs remained. According to our findings, there were 9,964 (95% CI, 6,088 to 17,636) IDUs not identified by any of the centers. Hence, the real number of IDUs is expected to be 11,333. Based on these findings, the overall completeness of the three data sources was around 12% (95% CI, 7% to 18%). CONCLUSION: There was a considerable number of IDUs not identified by any of the centers. Although the capture-recapture method is a useful and practical approach for estimating unknown populations, due to the assumptions and limitations of the method, the results must be interpreted with caution.

      • SCOPUSKCI등재

        Survival Analysis of Gastric Cancer Patients with Incomplete Data

        Moghimbeigi, Abbas,Tapak, Lily,Roshanaei, Ghodaratolla,Mahjub, Hossein The Korean Gastric Cancer Association 2014 Journal of gastric cancer Vol.14 No.4

        Purpose: Survival analysis of gastric cancer patients requires knowledge about factors that affect survival time. This paper attempted to analyze the survival of patients with incomplete registered data by using imputation methods. Materials and Methods: Three missing data imputation methods, including regression, expectation maximization algorithm, and multiple imputation (MI) using Monte Carlo Markov Chain methods, were applied to the data of cancer patients referred to the cancer institute at Imam Khomeini Hospital in Tehran in 2003 to 2008. The data included demographic variables, survival times, and censored variable of 471 patients with gastric cancer. After using imputation methods to account for missing covariate data, the data were analyzed using a Cox regression model and the results were compared. Results: The mean patient survival time after diagnosis was $49.1{\pm}4.4$ months. In the complete case analysis, which used information from 100 of the 471 patients, very wide and uninformative confidence intervals were obtained for the chemotherapy and surgery hazard ratios (HRs). However, after imputation, the maximum confidence interval widths for the chemotherapy and surgery HRs were 8.470 and 0.806, respectively. The minimum width corresponded with MI. Furthermore, the minimum Bayesian and Akaike information criteria values correlated with MI (-821.236 and -827.866, respectively). Conclusions: Missing value imputation increased the estimate precision and accuracy. In addition, MI yielded better results when compared with the expectation maximization algorithm and regression simple imputation methods.

      • Diabetic peripheral neuropathy class prediction by multicategory support vector machine model: a cross-sectional study

        Maryam Kazemi,Abbas Moghimbeigi,Javad Kiani,Hossein Mahjub,Javad Faradmal 한국역학회 2016 Epidemiology and Health Vol.38 No.-

        OBJECTIVES: Diabetes is increasing in worldwide prevalence, toward epidemic levels. Diabetic neuropathy, one of the most common complications of diabetes mellitus, is a serious condition that can lead to amputation. This study used a multicategory support vector machine (MSVM) to predict diabetic peripheral neuropathy severity classified into four categories using patients’ demographic characteristics and clinical features. METHODS: In this study, the data were collected at the Diabetes Center of Hamadan in Iran. Patients were enrolled by the convenience sampling method. Six hundred patients were recruited. After obtaining informed consent, a questionnaire collecting general information and a neuropathy disability score (NDS) questionnaire were administered. The NDS was used to classify the severity of the disease. We used MSVM with both one-against-all and one-against-one methods and three kernel functions, radial basis function (RBF), linear, and polynomial, to predict the class of disease with an unbalanced dataset. The synthetic minority class oversampling technique algorithm was used to improve model performance. To compare the performance of the models, the mean of accuracy was used. RESULTS: For predicting diabetic neuropathy, a classifier built from a balanced dataset and the RBF kernel function with a one-against-one strategy predicted the class to which a patient belonged with about 76% accuracy. CONCLUSIONS: The results of this study indicate that, in terms of overall classification accuracy, the MSVM model based on a balanced dataset can be useful for predicting the severity of diabetic neuropathy, and it should be further investigated for the prediction of other diseases.

      • KCI등재

        Survival Analysis of Gastric Cancer Patients with Incomplete Data

        Abbas Moghimbeigi,Lily Tapak,Ghodaratolla Roshanaei,Hossein Mahjub 대한위암학회 2014 Journal of gastric cancer Vol.14 No.4

        Purpose: Survival analysis of gastric cancer patients requires knowledge about factors that affect survival time. This paper attempted to analyze the survival of patients with incomplete registered data by using imputation methods. Materials and Methods: Three missing data imputation methods, including regression, expectation maximization algorithm, and multiple imputation (MI) using Monte Carlo Markov Chain methods, were applied to the data of cancer patients referred to the cancer institute at Imam Khomeini Hospital in Tehran in 2003 to 2008. The data included demographic variables, survival times, and censored variable of 471 patients with gastric cancer. After using imputation methods to account for missing covariate data, the data were analyzed using a Cox regression model and the results were compared. Results: The mean patient survival time after diagnosis was 49.1±4.4 months. In the complete case analysis, which used information from 100 of the 471 patients, very wide and uninformative confidence intervals were obtained for the chemotherapy and surgery hazard ratios (HRs). However, after imputation, the maximum confidence interval widths for the chemotherapy and surgery HRs were 8.470 and 0.806, respectively. The minimum width corresponded with MI. Furthermore, the minimum Bayesian and Akaike information criteria values correlated with MI (−821.236 and −827.866, respectively). Conclusions: Missing value imputation increased the estimate precision and accuracy. In addition, MI yielded better results when compared with the expectation maximization algorithm and regression simple imputation methods.

      • Estimation of Survival Rates in Patients with Lung Cancer in West Azerbaijan, the Northwest of Iran

        Abazari, Malek,Gholamnejad, Mahdia,Roshanaei, Ghodratollah,Abazari, Reza,Roosta, Yousef,Mahjub, Hossein Asian Pacific Journal of Cancer Prevention 2015 Asian Pacific journal of cancer prevention Vol.16 No.9

        Background: Lung cancer is a fatal malignancy with high mortality and short survival time. The aim of this study was to estimate survival rates of Iranian patients with lung cancer and its associate predictive factors. Materials and Methods: The study was conducted on 355 patients admitted to hospitals of West Azerbaijan in the year 2007. The patients were followed up by phone calls until the end of June 2014. The survival rate was estimated using the Kaplan-Meier method and log-rank test for comparison. The Cox's proportional hazard model was used to investigate the effect of various variables on patient survival time, including age, sex, Eastern Cooperative Oncology Group (ECOG) performance, smoking status, tumor type, tumor stage, treatment, metastasis, and blood hemoglobin concentration. Results: Of the 355 patients under study, 240 died and 115 were censored. The mean and median survival time of patients was 13 and 4.8 months, respectively. According to the results of Kaplan-Meier method, 1, 2, and 3 years survival rates were 39%, 18%, and 0.07%, respectively. Based on Cox regression analysis, the risk of death was associated with ECOG group V (1.83, 95% CI: 1 Conclusions: The survival time of the patients with lung cancer is very short. While early diagnosis may improve the life expectancy effective treatment is not available.

      • KCI등재

        Predicting Hospital Readmission in Heart Failure Patients in Iran: A Comparison of Various Machine Learning Methods

        Roya Najafi-Vosough,Javad Faradmal,Seyed Kianoosh Hosseini,Abbas Moghimbeigi,Hossein Mahjub 대한의료정보학회 2021 Healthcare Informatics Research Vol.27 No.4

        Objectives: Heart failure (HF) is a common disease with a high hospital readmission rate. This study considered class imbalanceand missing data, which are two common issues in medical data. The current study’s main goal was to compare theperformance of six machine learning (ML) methods for predicting hospital readmission in HF patients. Methods: In thisretrospective cohort study, information of 1,856 HF patients was analyzed. These patients were hospitalized in FarshchianHeart Center in Hamadan Province in Western Iran, from October 2015 to July 2019. The support vector machine (SVM),least-square SVM (LS-SVM), bagging, random forest (RF), AdaBoost, and naïve Bayes (NB) methods were used to predicthospital readmission. These methods’ performance was evaluated using sensitivity, specificity, positive predictive value, negativepredictive value, and accuracy. Two imputation methods were also used to deal with missing data. Results: Of the 1,856HF patients, 29.9% had at least one hospital readmission. Among the ML methods, LS-SVM performed the worst, with accuracyin the range of 0.57–0.60, while RF performed the best, with the highest accuracy (range, 0.90–0.91). Other ML methodsshowed relatively good performance, with accuracy exceeding 0.84 in the test datasets. Furthermore, the performance ofthe SVM and LS-SVM methods in terms of accuracy was higher with the multiple imputation method than with the medianimputation method. Conclusions: This study showed that RF performed better, in terms of accuracy, than other methods forpredicting hospital readmission in HF patients.

      연관 검색어 추천

      이 검색어로 많이 본 자료

      활용도 높은 자료

      해외이동버튼