데이터의 공간적인 분포를 이용한 가변 임계값 기반 특징선택
- Author(s)
- 손창식; 신아미; 이영동; 박희준; 박형섭; 김윤년
- Keimyung Author(s)
- Park, Hee Jun; Park, Hyoung Seob; Kim, Yoon Nyun
- Department
- Dept. of Biomedical Engineering (의용공학과)
Dept. of Internal Medicine (내과학)
- Journal Title
- 대한의료정보학회지
- Issued Date
- 2009
- Volume
- 15
- Issue
- 4
- Abstract
- Objective: In processing high dimensional clinical data, choosing the optimal subset of features is important, not only for
reduce the computational complexity but also to improve the value of the model constructed from the given data. This study
proposes an efficient feature selection method with a variable threshold. Methods: In the proposed method, the spatial
distribution of labeled data, which has non-redundant attribute values in the overlapping regions, was used to evaluate the
degree of intra-class separation, and the weighted average of the redundant attribute values were used to select the cut-off
value of each feature. Results: The effectiveness of the proposed method was demonstrated by comparing the experimental
results for the dyspnea patients’ dataset with 11 features selected from 55 features by clinical experts with those obtained
using seven other classification methods. Conclusion: The proposed method can work well for clinical data mining and
pattern classification applications. (Journal of Korean Society of Medical Informatics 15-4, 475-481, 2009)
Key words: Feature Selection, Variable Threshold, Pattern Classification, Dyspnea Patients
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.