DP-201 practice questions
You need to recommend an Azure Cosmos DB solution that meets the following requirements: ✑ All data that was NOT modified during the last 30 days must be purged automatically. ✑ The solution must NOT affect ongoing user requests. What should you recommend using to purge the data?
A. an Azure Cosmos DB stored procedure executed by an Azure logic app
B. an Azure Cosmos DB REST API Delete Document operation called by an Azure function
C. Time To Live (TTL) setting in Azure Cosmos DB
D. an Azure Cosmos DB change feed queried by an Azure function
Discussion forum
Question
You are planning a streaming data solution that will use Azure Databricks. The solution will stream sales transaction data from an online store. The solution has the following specifications: ✑ The output data will contain items purchased, quantity, line total sales amount, and line total tax amount. ✑ Line total sales amount and line total tax amount will be aggregated in Databricks. ✑ Sales transactions will never be updated. Instead, new rows will be added to adjust a sale. You need to recommend an output mode for the dataset that will be processed by using Structured Streaming. The solution must minimize duplicate data. What should you recommend?
A. Append
B. Complete
C. Update
Discussion forum
Question
A company plans to use Apache Spark Analytics to analyze intrusion detection data. You need to recommend a solution to analyze network and system activities for malicious activities and policy violations. The solution must minimize administrative efforts. What should you recommend?
A. Azure Data Factory
B. Azure Data Lake Storage
C. Azure Databricks
D. Azure HDInsight
Discussion forum
Question
You design data engineering solutions for a company. A project requires analytics and visualization of large set of data. The project has the following requirements: ✑ Notebook scheduling ✑ Cluster automation ✑ Power BI Visualization You need to recommend the appropriate Azure service. Your solution must minimize the number of services required. Which Azure service should you recommend?
A. Azure Batch
B. Azure Stream Analytics
C. Azure Databricks
D. Azure HDInsight
Discussion forum
Question
You plan to migrate data to Azure SQL Database. The database must remain synchronized with updates to Microsoft Azure and SQL Server. You need to set up the database as a subscriber. What should you recommend?
A. Azure Data Factory
B. SQL Server Data Tools
C. Data Migration Assistant
D. SQL Server Agent for SQL Server 2017 or later
E. SQL Server Management Studio 17.9.1 or later
Discussion forum
Question
You are designing a real-time stream solution based on Azure Functions. The solution will process data uploaded to Azure Blob Storage. The solution requirements are as follows: ✑ Support up to 1 million blobs. ✑ Scaling must occur automatically. ✑ Costs must be minimized. What should you recommend?
A. Deploy the Azure Function in an App Service plan and use a Blob trigger.
B. Deploy the Azure Function in a Consumption plan and use an Event Grid trigger.
C. Deploy the Azure Function in a Consumption plan and use a Blob trigger.
D. Deploy the Azure Function in an App Service plan and use an Event Grid trigger.
Discussion forum
Question
A company purchases IoT devices to monitor manufacturing machinery. The company uses an IoT appliance to communicate with the IoT devices. The company must be able to monitor the devices in real-time. You need to design the solution. What should you recommend?
A. Azure Data Factory instance using Azure Portal
B. Azure Analysis Services using Microsoft Visual Studio
C. Azure Stream Analytics cloud job using Azure Portal
D. Azure Data Factory instance using Azure Portal
Discussion forum
Question
You have a C# application that process data from an Azure IoT hub and performs complex transformations. You need to replace the application with a real-time solution. The solution must reuse as much code as possible from the existing application.
A. Azure Databricks
B. Azure Event Grid
C. Azure Stream Analytics
D. Azure Data Factory
Discussion forum
Question
You are designing a solution that will copy Parquet files stored in an Azure Blob storage account to an Azure Data Lake Storage Gen2 account. The data will be loaded daily to the data lake and will use a folder structure of {Year}/{Month}/{Day}/. You need to design a daily Azure Data Factory data load to minimize the data transfer between the two accounts. Which two configurations should you include in the design? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.
A. Delete the files in the destination before loading new data.
B. Filter by the last modified date of the source files.
C. Delete the source files after they are copied.
D. Specify a file naming pattern for the destination.
Discussion forum
Question
You have a large amount of sensor data stored in an Azure Data Lake Storage Gen2 account. The files are in the Parquet file format. New sensor data will be published to Azure Event Hubs. You need to recommend a solution to add the new sensor data to the existing sensor data in real-time. The solution must support the interactive querying of the entire dataset. Which type of server should you include in the recommendation?
A. Azure SQL Database
B. Azure Cosmos DB
C. Azure Stream Analytics
D. Azure Databricks
Discussion forum
Question
You have an Azure Databricks workspace named workspace1 in the Standard pricing tier. Workspace1 contains an all-purpose cluster named cluster1. You need to reduce the time it takes for cluster1 to start and scale up. The solution must minimize costs. What should you do first?
A. Upgrade workspace1 to the Premium pricing tier.
B. Create a pool in workspace1.
C. Configure a global init script for workspace1.
D. Create a cluster policy in workspace1.
Discussion forum
Question
You have a Windows-based solution that analyzes scientific data. You are designing a cloud-based solution that performs real-time analysis of the data. You need to design the logical flow for the solution. Which two actions should you recommend? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.
A. Send data from the application to an Azure Stream Analytics job.
B. Use an Azure Stream Analytics job on an edge device. Ingress data from an Azure Data Factory instance and build queries that output to Power BI.
C. Use an Azure Stream Analytics job in the cloud. Ingress data from the Azure Event Hub instance and build queries that output to Power BI.
D. Use an Azure Stream Analytics job in the cloud. Ingress data from an Azure Event Hub instance and build queries that output to Azure Data Lake Storage.
E. Send data from the application to Azure Data Lake Storage.
F. Send data from the application to an Azure Event Hub instance.
Discussion forum
Question
A company installs IoT devices to monitor its fleet of delivery vehicles. Data from devices is collected from Azure Event Hub. The data must be transmitted to Power BI for real-time data visualizations. You need to recommend a solution. What should you recommend?
A. Azure HDInsight with Spark Streaming
B. Apache Spark in Azure Databricks
C. Azure Stream Analytics
D. Azure HDInsight with Storm
Discussion forum
Question
You are designing an Azure Data Factory pipeline for processing data. The pipeline will process data that is stored in general-purpose standard Azure storage. You need to ensure that the compute environment is created on-demand and removed when the process is completed. Which type of activity should you recommend?
A. Databricks Python activity
B. Data Lake Analytics U-SQL activity
C. HDInsight Pig activity
D. Databricks Jar activity
Discussion forum
Question
You are designing an anomaly detection solution for streaming data from an Azure IoT hub. The solution must meet the following requirements: ✑ Send the output to Azure Synapse. ✑ Identify spikes and dips in time series data. ✑ Minimize development and configuration effort Which should you include in the solution?
A. Azure Databricks
B. Azure Stream Analytics
C. Azure SQL Database
Discussion forum
Question