Data seem to be everywhere. And with the development of social networks, smartphones, and other technical systems, information is now the new gold for businesses. That’s why many new techniques and procedures are created to search, collect, clean, and analyse the data. Whether you are a layman or a junior data scientist, check out these 35 data mining quiz questions and answers to test your knowledge.
Data Mining Quiz Questions and Answers
1. What is the main goal of data mining?
A. To store and distribute data
B. To turn raw data into helpful information
C. To classify data for further tasks
2. Width and height belong to which form of data?
A. Continuous data
B. Discrete data
C. Finite data
3. Which process finds hidden patterns in unlabeled data?
A. Reinforcement learning
B. Unsupervised learning
C. Supervised learning
4. In addition to machine learning and artificial intelligence, which branch of mathematics is the foundation of data mining?
A. Linear algebra
B. Geometry
C. Statistics
5. Which type of modelling are decision trees?
A. Descriptive modelling
B. Predictive modelling
C. Prescriptive modelling
6. Which of the following is not an application of data mining?
A. Financial fraud detection
B. Spam e-mail filtering
C. Cost minimisation
7. Which process takes operational data from different sources and map it to a new structure in the warehouse?
A. Transformation
B. Integration
C. Combination
8. In data mining, which stage involves collecting and preparing data?
A. Exploration
B. Validation
C. Both A and B
9. In a decision tree, each node is either a decision node or …
A. A sub node
B. A root node
C. A leaf node
10. What is the application of the Naïve Bayes Algorithm?
A. To estimate the probability of a class value in prediction and classification
B. To generate a mining model
C. Both A and B
11. Which of the following algorithms can be used for finding correlations among attributes in data?
A. Series algorithm
B. Association algorithm
C. Associative algorithm
12. A supermarket wants to divide their customers into different groups with distinctive features. Which process should be used?
A. Unsupervised learning
B. Supervised learning
C. Reinforcement learning
13. Which technique can be used to predict the number of newborn babies based on existing data?
A. Clustering
B. Classification
C. Regression
14. Which of the following is a basic process of data mining?
A. Infrastructure, exploration, analysis, interpretation, and exploitation
B. Infrastructure, analysis, exploration, exploitation, and interpretation
C. Infrastructure, exploration, analysis, exploitation, and interpretation
15. What is a binary attribute?
A. A system that can be used without previous knowledge of the internal operation
B. Variables with only two values
C. A natural environment of some species
16. A self-organizing map is a typical example of which technique?
A. Supervised learning
B. Unsupervised learning
C. Reinforcement learning
17. What does ETL stand for?
A. Exclude, transport, load
B. Extract, transform, load
C. Extract, transport, lift
18. What do we call the process of dividing data into three separate parts: a training set, a validation set, and a testing set?
A. Data exploring
B. Data modelling
C. Data partitioning
19. When was the term “data mining” first used?
A. In the 1980s
B. In the 1990s
C. In the 2000s
20. What is often considered the early methods to identify patterns in data?
A. Bayes’s theorem
B. Regression analysis
C. Cluster analysis
21. How many phases are there in the Cross-industry standard process for data mining or CRISP-DM?
A. 3
B. 4
C. 6
22. Which technique is mostly used to discover structures or groups with similar features in data mining?
A. Regression
B. Classification
C. Clustering
23. Knowledge Discovery in Databases or KDD is referred to …
A. A collection of useful and interesting patterns in data
B. A set of columns in data that can be used for identifying each record uniquely
C. Non-trivial extraction of possibly useful and previously unknown information in data
24. What are the two important features of a good learning algorithm?
A. Complex and transparent
B. Complete and complex
C. Consistent and complete
25. What do metadata describe?
A. Structure of the database
B. Contents of database
C. Structure of the database’s contents
26. What is needed for every data mining algorithm?
A. Enough capacity to store a large amount of data
B. An efficient method of sampling data
C. Both A and B
27. In the KDD process, which stage follows data selection?
A. Data storing
B. Data cleaning
C. Data exploring
28. What does the letter K stand for in the K-nearest neighbour algorithm?
A. Number of total observations in the dataset
B. Number of neighbours that are used
C. Number of iterations
29. What does noise mean in data mining?
A. Random errors in the dataset
B. Complex data
C. Repeated data
30. In a scatter plot, the information on 2 attributes is displayed in which space?
A. Interactive space
B. Visualisation space
C. Cartesian space
31. Which process is used to rescaling the values of data into a range from 0 to 1?
A. Data normalisation
B. Data preprocessing
C. Data churning
32. Which data mining technique uses nodes to process data?
A. K-nearing neighbour
B. Neural network
C. Decision tree
33. pandas, numpy, and scikit-learn are libraries for data mining in which programming language?
A. R
B. Javascript
C. Python
34. Data mining can be divided into which two main types?
A. Descriptive and predictive
B. Explorative and assumptive
C. Traditional and modern
35. In the 2010s, a British company called Cambridge Analytica collected data from which social media plat form for political advertising?
A. Twitter
B. Facebook
C. YouTube
Can you answer all of these data mining quiz questions and answers? We hope you have learned many interesting facts and helpful knowledge about this new field. Check out the following math quiz, Bitcoin quiz, or this trivia measurement quiz for more.