1 AIT Asian Institute of Technology

An extended k-means++ with mixed attributes for outlier detection.

AuthorSarunya Kanjanawattana
Call NumberAIT Thesis no.CS-11-14
Subject(s)Accounting fraud
Fraud
Misleading financial statements
Cluster analysis--Data processing

NoteA thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer science, School of Engineering and Technology
PublisherAsian Institute of Technology
Series StatementThesis ; no. CS-11-14
AbstractFraud detection becomes an important task nowadays especially a health insurance. Some developing country such as Thailand, we use human inspection and some heuristic rules to detect fraud on financial statements. That spends a lot of time to success and may provide incorrect results. This is a reason why we need effective fraud detection system to support the auditing of organization process. We proposed_ a new algorithm, called MixKmeans++, which is an extension of K-means++. The new algorithm can be applied to mixed numeric and categorical data. The limitation of K-means and K-means++ is to apply with only numeric data. Though, almost data in real world are combined by categorical and numeric data. This MixK-means++ algorithm can overcome this disadvantage. We compared the speed and performance of clustering between a standard K-means versus MixK-means++, and the results of outlier detection. MixK-means++ provided favor results even it worked with mixed attributes of data set. It was better both speed and performance than K-means. In order to compare the performance in outlier detection system, we can determine that MixK-means++ was also better in term of outlier detection because it provided higher detection rate than K-means even some cases of K-means might offer greater accuracy rate.
Year2011
Corresponding Series Added EntryAsian Institute of Technology. Thesis : no. CS-11-14
TypeThesis
SchoolSchool of Engineering and Technology (SET)
DepartmentDepartment of Information and Communications Technologies (DICT)
Academic Program/FoSComputer Science (CS)
Chairperson(s)Guha, Sumanta;
Examination Committee(s)Phan Minh Dung;Dailey, Matthew N.;
Scholarship Donor(s)Royal Thai Government.;
DegreeThesis (M. Sc.) - Asian Institute of Technology, 2011


Usage Metrics
View Detail0
Read PDF0
Download PDF0