Approaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
Rule induction is an area of machine learning in which formal rules are extracted from a set of observations. Machine learning is a field of artificial intelligence that uses statistical techniques to give computer systems the ability to learn from data, without being explicitly programmed. Machine learning applications are classification, regression, clustering, density estimation and dimensionality reduction. The CN2 algorithm is a classification technique designed for the efficient induction of simple, comprehensible rules of form “if cond then predict class”, even in domains where noise may be present. Biodiversity means biological diversity, the variety of life found in a place on Earth or, often, the total variety of life on Earth. This research used butterflies as biological dataset for categorizing biodiversity and passed it to CN2 Rule Induction. In this research, “The Fauna of British India, Ceylon and Burma. Butterflies. Vol. I and Vol. II” written by C.T Bingham are used as the required knowledge for resource and categorizing biodiversity of butterfly families by rules induction with CN2 algorithm system has developed. In this system, MS Visual Studio as a programming tool and MS SQL Server as for database development are used.
Machine Learning, Rule Induction, CN2 Algorithm, Biodiversity
Su Myo Swe | Khin Myo Sett