DRAG DROP
What is the best step by step order for machine learning pipeline?
Select and Place:
Which test is applied to determine the relationship between two categorical variables?
A. paired t-test
B. chi squared test
C. z test
D. t-test
What is used to scale large positive values during data cleaning?
A. division by random numbers
B. square
C. logarithm
D. subtract median
What is an example of a supervised machine learning algorithm that can be applied to a continuous numeric response variable?
A. linear regression
B. k-means
C. local outlier factor (LOF)
D. naive Bayes
A neural network is trained for a classification task. During training, you monitor the loss function for the train dataset and the validation dataset, along with the accuracy for the validation dataset. The goal is to get an accuracy of 95%.
From the graph, what modification would be appropriate to improve the performance of the model?
A. increase the depth of the neural network
B. insert a dropout layer in the neural network architecture
C. increase the proportion of the train dataset by moving examples from the validation dataset to the train dataset
D. restart the training with a higher learning rate
When communicating technical results to business stakeholders, what are three appropriate topics to include? (Choose three.)
A. methods that failed
B. newest developments in AI methods
C. benefits of cognitive over business analytics
D. realistic impact on the business measures
E. differences between cloud provider portfolios
F. alternative methods to address the business problem
Which situation would disqualify a machine learning system from being used for a particular use case?
A. The use case requires a 100% likelihood of making a correct/true prediction.
B. Training and testing data for the model contain outliers.
C. Data for the machine learning model is available only as static CSV files.
D. The neural network for the model requires significantly more computing power than a logistic regression model.
What are three elements that are typically part of a machine learning pipeline in scikit-learn or pyspark? (Choose three.)
A. model building
B. data preprocessing
C. model prediction
D. business understanding
E. use case selection F. data exploration
Which is an example of a nominal scale data?
A. a variable industry with categorical values such as financial, engineering, and retail
B. a variable mood with a scale of values unhappy, ok, and happy
C. a variable bank account balance whose possible values are $5, $10, and $15
D. a variable temperature with a scale of values low, medium, and high
Which algorithm is best suited if a client needs full explainability of the machine learning model?
A. decision tree
B. logistic regression
C. support vector machine (SVM)
D. recurrent neural network