Data mining is an English term linked to computer science whose translation is data mining .It consists of a functionality that organizes data in relevant patterns ,associations , changes and anomalies .The term data mining first emerged in 1990 in database communities. Data mining is the process analysis step known as KDD ( Knowledge Discovery in Databases ), its literal translation being “Knowledge Discovery in Databases”.
The data mining can be divided into a few basic steps that are: exploration, construction model, standard definition and validation and verification.
Data mining is a relatively recent practice in the world of computing, and it uses information retrieval, artificial intelligence, pattern recognition, and statistical techniques to search for correlations between different data that enable you to gain beneficial knowledge for a company or individual. For a company, data mining can be an important tool that enhances innovation and profitability.
The use of data mining is quite common in large databases, and the end result of their use can be displayed through rules, hypotheses, decision trees, dendrograms, etc.Well-executed data mining should accomplish such tasks as: anomaly detection, association rule learning (dependency modeling), clustering , classification, regression, and summarization. The data mining process typically occurs using data contained within the data warehouse .
There are several companies and software that are dedicated to data mining, as the identification of patterns in databases is increasingly important. However, the identification of relevant standards is not unique to the computer world. The human brain uses a similar process to identify patterns and acquire knowledge.
In recent years, data mining has been widely used in the areas of science and engineering such as bioinformatics, genetics, medicine, education and electrical engineering.The concept of data mining is often associated with extracting information about people’s behavior. For this reason, in some situations, data mining raises legal issues and issues related to privacy and ethics. Despite this, many people claim that data mining is ethically neutral as it has no ethical implications.
Real Data Mining Examples
Data mining is often used by companies and organizations to gain knowledge about users / employees / customers. For example, in the public sector it is possible to cross-check between the marital status of an employee and the salary he earns to see if this has an influence on his marital life.Companies such as supermarket chains can use this cross-data to determine products that are purchased together. If a customer who buys product X also buys product Y, it may be a good idea to position the two products nearby to make it easier for the customer to buy.