Microsoft Certified Azure Data Scientist Associate, the DP-100 exam measures your ability to accomplish technical tasks like:

Example Questions


You need to resolve the local machine learning pipeline performance issue. What should you do?

A. Increase Graphic Processing Units (GPUs).
B. Increase the learning rate.
C. Increase the training iterations,
D. Increase Central Processing Units (CPUs).


You need to select an environment that will meet the business and data requirements.
Which environment should you use?

A. Azure HDInsight with Spark MLlib
B. Azure Cognitive Services
C. Azure Machine Learning Studio
D. Microsoft Machine Learning Server


You are using C-Support Vector classification to do a multi-class classification with an unbalanced training dataset. The C-Support Vector classification using Python code shown below:

from sklearn.svm import SVC
import numpy as np
svc=SVC(kernel='linear',class_weight='balanced',C=1.0, random_state=1)
model=svc.fit(X_train,y)

Which evaluation statement should you use? To answer, select the appropriate options in the answer area.

class_weight=’balanced’ :

C parameter:


You need to modify the inputs for the global penalty event model to address the bias and variance issue. Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

  1. Built ratios
  2. Bin the new data
  3. Add a K-Means clustering module with 10 clusters
  4. Select the behavior data
  5. Select the location data
  6. Perform a Primary Component Analysis (PCA)

You need to define a process for penalty event detection. Which three actions should you perform in sequence? To answer, move the appropriate actions from
the list of actions to the answer area and arrange them in the correct order.

  1. Standardize to mono audio clips
  2. Vary the lenght of sliding windows between modeling epochs
  3. Vary the lenght of frequency bands between modeling epochs
  4. Use inverse Fourier transform on frequency changes over time
  5. Use Fast Fourier transform on frequency changes over time
  6. Standardize to stereo audio clips

You need to define an evaluation strategy for the crowd sentiment models. Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.

  1. Add new features for retraining supervised models
  2. Filter labeled cases for retraining using the shortest distance from centroids
  3. Evaluate the changes in correlation between model error rate and centroid distance
  4. Impute unavailable features with centroid aligned models
  5. Filter labeled cases for retraining using the longest distance from centroids
  6. Remove features before retraining supervised models

You need to implement a scaling strategy for the local penalty detection data. Which normalization type should you use?


You need to implement a feature engineering strategy for the crowd sentiment local models. What should you do?