Top 100 interview questions on Data Science & Machine Learning
Upasana | September 06, 2019 | 3 min read | 1,101 views
Questions could comprise of coding and theory questions both based on tech skills as : Language: R, Python Skills: Machine Learning, Statistics
Question bank on data science concepts
- 
Why did you do masters in mathematics? 
- 
Rate yourself in statistics. 
- 
Rate yourself in Machine Learning. 
- 
You don’t know java. Why should we hire you? 
- 
Rate yourself in R. 
- 
How will you extract data based on only two categories from column A and one category from Column B of a data frame in R? 
- 
Do you know how to pseudo code in python? 
- 
Are you aware of SQL DB? 
- 
Can you give examples for uniformly distributed dataset? 
- 
Why didn’t you normalise dataset? (While i was explaining a project) 
- 
Explain chi-square distribution? 
- 
When will you use chi-square test? 
- 
What is the difference between decision trees and random forest? 
- 
How will you tune random forest model? 
- 
What is confusion matrix? 
- 
How will you evaluate a classification model? 
- 
What is variance and bias? 
- 
What is recall score? 
- 
What is precision score? 
- 
What is F-score? 
- 
We don’t know how data science works. We can just get you a project and get connected with client. You will have to solve the problem yourself. can you do that? 
- 
You haven’t worked in any production level project. Why? 
- 
You have mostly worked with AV’s and Kaggle’s datasets. Why not some real dataset? 
- 
How does correlation plays role in modelling? 
- 
What would you do if you have got imbalanced dataset to work upon? 
- 
Write SQL Query to get top 5 students w.r.t marks in mathematics if you have a table which contains data of mark sheet of students of a class. 
- 
Which algorithm was used in restaurant reviews classification project? 
- 
Explain Naive Bayes and its assumptions. 
Machine Learning question bank for experienced
- 
Explain this project and your role? 
- 
What problems you faced while working on A project? 
- 
How many frames were passed per second in openCV based project? 
- 
Why did you use regulariser? 
- 
Have you worked on pyspark, if yes then which one?(RDD or Dataframe) 
- 
Explain ReLu. 
- 
When will you use ReLu? 
- 
Difference between softmax and sigmoid function? 
- 
How does CNN works? 
- 
What is max pooling and how does it works? 
- 
What is recall score? 
- 
What is precision score? 
- 
What is F-score? 
- 
What was the data source used in A project? 
- 
Why have you mostly worked with keras and why not tensorflow? 
- 
Do you know data structures? 
- 
Can you write a custom function for deep learning model? 
- 
what is the role of loss function and optimiser in a deep learning model? 
- 
How will you use SVM for multi categorical classification(more than 2 categories? 
- 
How does strides work in CNN? 
- 
What are the libraries that you worked with in Python? 
- 
What are the libraries that you worked with in Python related to NLP? 
- 
Where have you used pandas in your projects? 
- 
What is tensor? 
- 
Define features you would need so as to tell if a person is diabetic or not? 
- 
Tell a project in which you had to create data from scratch and the problems you faced while working on it. 
- 
What base model you would suggest for Google’s smart reply? 
Top articles in this category:
- Google Data Scientist interview questions with answers
- Machine Learning based Multiple choice questions
- Flask Interview Questions
- Python coding challenges for interviews
- Installing PySpark with Jupyter notebook on Ubuntu 18.04 LTS
- Difference between Loss, Accuracy, Validation loss, Validation accuracy in Keras
- AWS Lambda Interview Questions for Developers
 
                         
                        