Question.16 During mini-batch training of a neural network for a classification problem, a Data Scientist notices that training accuracy oscillates. What is the MOST likely cause of this issue? (A) The class distribution in the dataset is imbalanced. (B) Dataset shuffling is disabled. (C) The batch size is too big. (D) The learning rate is very high. |
16. Click here to View Answer
Correct Answer: D
Question.17 An employee found a video clip with audio on a company’s social media feed. The language used in the video is Spanish. English is the employee’s first language, and they do not understand Spanish. The employee wants to do a sentiment analysis. What combination of services is the MOST efficient to accomplish the task? (A) Amazon Transcribe, Amazon Translate, and Amazon Comprehend (B) Amazon Transcribe, Amazon Comprehend, and Amazon SageMaker seq2seq (C) Amazon Transcribe, Amazon Translate, and Amazon SageMaker Neural Topic Model (NTM) (D) Amazon Transcribe, Amazon Translate and Amazon SageMaker BlazingText |
17. Click here to View Answer
Correct Answer: A
Question.18 A Machine Learning Specialist is packaging a custom ResNet model into a Docker container so the company can leverage Amazon SageMaker for training. The Specialist is using Amazon EC2 P3 instances to train the model and needs to properly configure the Docker container to leverage the NVIDIA GPUs. What does the Specialist need to do? (A) Bundle the NVIDIA drivers with the Docker image. (B) Build the Docker container to be NVIDIA-Docker compatible. (C) Organize the Docker container’s file structure to execute on GPU instances. (D) Set the GPU flag in the Amazon SageMaker CreateTrainingJob request body. |
18. Click here to View Answer
Correct Answer: B
Question.19 A Machine Learning Specialist is building a logistic regression model that will predict whether or not a person will order a pizza. The Specialist is trying to build the optimal model with an ideal classification threshold. What model evaluation technique should the Specialist use to understand how different classification thresholds will impact the model’s performance? (A) Receiver operating characteristic (ROC) curve (B) Misclassification rate (C) Root Mean Square Error (RMSE) (D) L1 norm |
19. Click here to View Answer
Correct Answer: A
Question.20 An interactive online dictionary wants to add a widget that displays words used in similar contexts. A Machine Learning Specialist is asked to provide word features for the downstream nearest neighbor model powering the widget. What should the Specialist do to meet these requirements? (A) Create one-hot word encoding vectors. (B) Produce a set of synonyms for every word using Amazon Mechanical Turk. (C) Create word embedding vectors that store edit distance with every other word. (D) Download word embeddings pre-trained on a large corpus. |
20. Click here to View Answer
Correct Answer: D