Data seem to be everywhere. And with the development of social networks, smartphones, and other technical systems, information is now the new gold for businesses. That’s why many new techniques and procedures are created to search, collect, clean, and analyze the data. Whether you are a layman or a junior data scientist, check out these data mining quiz questions and answers to test your knowledge.

## Data Mining Quiz Questions and Answers

### 1. What is the main goal of data mining?

A. To store and distribute data

B. To turn raw data into helpful information

C. To classify data for further tasks

### 2. Width and height belong to which form of data?

A. Continuous data

B. Discrete data

C. Finite data

### 3. Which process finds hidden patterns in unlabeled data?

A. Reinforcement learning

B. Unsupervised learning

C. Supervised learning

### 4. In addition to machine learning and artificial intelligence, which branch of mathematics is the foundation of data mining?

A. Linear algebra

B. Geometry

C. Statistics

### 5. Which type of modelling are decision trees?

A. Descriptive modelling

B. Predictive modelling

C. Prescriptive modelling

### 6. Which of the following is not an application of data mining?

A. Financial fraud detection

B. Spam e-mail filtering

C. Cost minimisation

### 7. Which process takes operational data from different sources and map it to a new structure in the warehouse?

A. Transformation

B. Integration

C. Combination

### 8. In data mining, which stage involves collecting and preparing data?

A. Exploration

B. Validation

C. Both A and B

### 9. In a decision tree, each node is either a decision node or …

A. A sub node

B. A root node

C. A leaf node

### 10. What is the application of the Naïve Bayes Algorithm?

A. To estimate the probability of a class value in prediction and classification

B. To generate a mining model

C. Both A and B

### 11. Which of the following algorithms can be used for finding correlations among attributes in data?

A. Series algorithm

B. Association algorithm

C. Associative algorithm

### 12. A supermarket wants to divide their customers into different groups with distinctive features. Which process should be used?

A. Unsupervised learning

B. Supervised learning

C. Reinforcement learning

### 13. Which technique can be used to predict the number of newborn babies based on existing data?

A. Clustering

B. Classification

C. Regression

### 14. Which of the following is a basic process of data mining?

A. Infrastructure, exploration, analysis, interpretation, and exploitation

B. Infrastructure, analysis, exploration, exploitation, and interpretation

C. Infrastructure, exploration, analysis, exploitation, and interpretation

### 15. What is a binary attribute?

A. A system that can be used without previous knowledge of the internal operation

B. Variables with only two values

C. A natural environment of some species

### 16. A self-organizing map is a typical example of which technique?

A. Supervised learning

B. Unsupervised learning

C. Reinforcement learning

### 17. What does ETL stand for?

A. Exclude, transport, load

B. Extract, transform, load

C. Extract, transport, lift

### 18. What do we call the process of dividing data into three separate parts: a training set, a validation set, and a testing set?

A. Data exploring

B. Data modelling

C. Data partitioning

### 19. When was the term “data mining” first used?

A. In the 1980s

B. In the 1990s

C. In the 2000s

### 20. What is often considered the early methods to identify patterns in data?

A. Bayes’s theorem

B. Regression analysis

C. Cluster analysis

### 21. How many phases are there in the Cross-industry standard process for data mining or CRISP-DM?

A. 3

B. 4

C. 6

### 22. Which technique is mostly used to discover structures or groups with similar features in data mining?

A. Regression

B. Classification

C. Clustering

### 23. Knowledge Discovery in Databases or KDD is referred to …

A. A collection of useful and interesting patterns in data

B. A set of columns in data that can be used for identifying each record uniquely

C. Non-trivial extraction of possibly useful and previously unknown information in data

### 24. What are the two important features of a good learning algorithm?

A. Complex and transparent

B. Complete and complex

C. Consistent and complete

### 25. What do metadata describe?

A. Structure of the database

B. Contents of database

C. Structure of the database’s contents

### 26. What is needed for every data mining algorithm?

A. Enough capacity to store a large amount of data

B. An efficient method of sampling data

C. Both A and B

### 27. In the KDD process, which stage follows data selection?

A. Data storing

B. Data cleaning

C. Data exploring

### 28. What does the letter K stand for in the K-nearest neighbour algorithm?

A. Number of total observations in the dataset

B. Number of neighbours that are used

C. Number of iterations

### 29. What does noise mean in data mining?

A. Random errors in the dataset

B. Complex data

C. Repeated data

### 30. In a scatter plot, the information on 2 attributes is displayed in which space?

A. Interactive space

B. Visualization space

C. Cartesian space

Can you answer all of these data mining quiz questions and answers? We hope you have learned many interesting facts and helpful knowledge about this new field.