Free DP-203 Exam Braindumps (page: 42)

Page 42 of 94

HOTSPOT (Drag and Drop is not supported)
You have an Azure Data Factory instance named ADF1 and two Azure Synapse Analytics workspaces named WS1 and WS2.

ADF1 contains the following pipelines:
-P1: Uses a copy activity to copy data from a nonpartitioned table in a dedicated SQL pool of WS1 to an Azure Data Lake Storage Gen2 account
-P2: Uses a copy activity to copy data from text-delimited files in an Azure Data Lake Storage Gen2 account to a nonpartitioned table in a dedicated SQL pool of WS2

You need to configure P1 and P2 to maximize parallelism and performance.
Which dataset settings should you configure for the copy activity if each pipeline? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

  1. See Explanation section for answer.

Answer(s): A

Explanation:


Box 1: Set the Copy method to PolyBase
While SQL pool supports many loading methods including non-Polybase options such as BCP and SQL BulkCopy API, the fastest and most scalable way to load data is through PolyBase. PolyBase is a technology that accesses external data stored in Azure Blob storage or Azure Data Lake Store via the T-SQL language.
Box 2: Set the Copy method to Bulk insert
Polybase not possible for text files. Have to use Bulk insert.


Reference:

https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/load-data-overview



HOTSPOT (Drag and Drop is not supported)
You have an Azure Storage account that generates 200,000 new files daily. The file names have a format of {YYYY}/{MM}/{DD}/{HH}/{CustomerID}.csv.
You need to design an Azure Data Factory solution that will load new data from the storage account to an Azure Data Lake once hourly. The solution must minimize load times and costs.

How should you configure the solution? To answer, select the appropriate options in the answer area. NOTE: Each correct selection is worth one point.
Hot Area:

  1. See Explanation section for answer.

Answer(s): A

Explanation:



Box 1: Incremental load
Box 2: Tumbling window
Tumbling windows are a series of fixed-sized, non-overlapping and contiguous time intervals. The following diagram illustrates a stream with a series of events and how they are mapped into 10-second tumbling windows.


Reference:

https://docs.microsoft.com/en-us/stream-analytics-query/tumbling-window-azure-stream-analytics



Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You plan to create an Azure Databricks workspace that has a tiered structure. The workspace will contain the following three workloads:
-A workload for data engineers who will use Python and SQL.
-A workload for jobs that will run notebooks that use Python, Scala, and SQL.
-A workload that data scientists will use to perform ad hoc analysis in Scala and R.

The enterprise architecture team at your company identifies the following standards for Databricks environments:
-The data engineers must share a cluster.
-The job cluster will be managed by using a request process whereby data scientists and data engineers provide packaged notebooks for deployment to the cluster.
-All the data scientists must be assigned their own cluster that terminates automatically after 120 minutes of inactivity. Currently, there are three data scientists.
-You need to create the Databricks clusters for the workloads.

Solution: You create a Standard cluster for each data scientist, a Standard cluster for the data engineers, and a High Concurrency cluster for the jobs.
Does this meet the goal?

  1. Yes
  2. No

Answer(s): B

Explanation:

We need a High Concurrency cluster for the data engineers and the jobs.
Note: Standard clusters are recommended for a single user. Standard can run workloads developed in any language: Python, R, Scala, and SQL.
A high concurrency cluster is a managed cloud resource. The key benefits of high concurrency clusters are that they provide Apache Spark-native fine-grained sharing for maximum resource utilization and minimum query latencies.


Reference:

https://docs.azuredatabricks.net/clusters/configure.html



You have the following Azure Data Factory pipelines:
-Ingest Data from System1
-Ingest Data from System2
-Populate Dimensions
-Populate Facts
-Ingest Data from System1 and Ingest Data from System2 have no dependencies.
-Populate Dimensions must execute after Ingest Data from System1 and Ingest Data from System2.
-Populate Facts must execute after Populate Dimensions pipeline.
-All the pipelines must execute every eight hours.

What should you do to schedule the pipelines for execution?

  1. Add an event trigger to all four pipelines.
  2. Add a schedule trigger to all four pipelines.
  3. Create a patient pipeline that contains the four pipelines and use a schedule trigger.
  4. Create a patient pipeline that contains the four pipelines and use an event trigger.

Answer(s): C

Explanation:

Schedule trigger: A trigger that invokes a pipeline on a wall-clock schedule.


Reference:

https://docs.microsoft.com/en-us/azure/data-factory/concepts-pipeline-execution-triggers



Page 42 of 94



Post your Comments and Discuss Microsoft DP-203 exam with other Community members:

Ashwani commented on December 20, 2024
Nice questions
UNITED KINGDOM
upvote

Chaminda commented on November 28, 2024
great papers
Anonymous
upvote

Michal commented on October 11, 2024
I hope it will worth it
POLAND
upvote

John commented on August 30, 2024
This exam dump helped me pass my DP-203 exam.
Anonymous
upvote

Rameez commented on July 08, 2024
This is a great resource
UNITED STATES
upvote

Robinson commented on June 28, 2024
Great work and challenge to oneself before sitting for the exam
Anonymous
upvote

Robinson commented on June 27, 2024
Honestly, this is a great resource.
Anonymous
upvote

Mike Liu commented on June 24, 2024
Very useful materials
SINGAPORE
upvote

Rod commented on June 13, 2024
Very professional content and professional team. The support team is knowledgeable polite and very quick to reply and help. I am happy with my purchase.
Australia
upvote

Gaston commented on June 13, 2024
After going over this free version of the exam I decided to buy the full PDF version and the free software that comes with it. I ma very glad I did it. Now it is much easier to study. I will post about my exam result once I write it next week. Wish me luck guys.
European Union
upvote

Wilma commented on June 13, 2024
Passed my AI-102 exam with this exam dumps. The exam is very hard at least for my knowledge. I am pretty new in the industry and I want to add as many certificates as I can to my CV.
UNITED STATES
upvote

Mel commented on June 13, 2024
Well written
Anonymous
upvote

Mel commented on June 13, 2024
Perfect queries
Anonymous
upvote

Tolaram commented on June 13, 2024
I bought 2 exams with 50% discount. I already passed this exam. I hope I can pass the second one as well. The questions in the first exam was word by word from this exam dump.
INDIA
upvote

Satheesh commented on June 12, 2024
Hi Guys, Are these dumps will help now also? Are these questions still comes in the exam. Please let me know.
INDIA
upvote

Arun commented on May 30, 2024
@Neetha, Pl let me know your comments whether is questions still in exam
SINGAPORE
upvote

Neetha commented on May 25, 2024
These dumps can help right now also.did anybody try recently.pls let me know. I am going to right dp-203 in next week.
CANADA
upvote

vamsi commented on May 07, 2024
this is helping a lot
Anonymous
upvote

Jain commented on April 26, 2024
I have used 3 Microsoft study packages from this site and passed all 3 of my exams. The contract followes all the topics and scenarios of the exam.
INDIA
upvote

Saira commented on March 14, 2023
I was skeptical at first, but this exam dump helped me pass my test!
UNITED KINGDOM
upvote

Tory commented on January 11, 2023
Welcome to the wold of easy passing. LOL Gotta love these brain dumps!
CANADA
upvote

Masomba commented on June 11, 2022
I foudn about 85% to 90% of the questions in the exam. This is a valid dumps guys.
SOUTH AFRICA
upvote

Shawn commented on March 18, 2022
Just passed with 91% mark today.
UNITED STATES
upvote

Muhammed commented on March 18, 2022
The support team is very helpful. They managed to fix the issue I had with my Xengine App software becuase I am running Arabic OS.
UNITED ARAB EMIRATES
upvote

Urmila commented on March 18, 2022
I really apprecaite the 50% discount. I bouth 3 exams for half price. I already passed 1 exam. The other 2 are underway.
UNITED STATES
upvote

Lisa commented on October 07, 2021
This makes the exam like a piece of cake. Very accurate. I recommend.
UNITED STATES
upvote

Armd-Educator commented on August 30, 2021
I am officially certified now. Thanks to Braindumps-pdf website. Their questions and the Xengine Software is the best.
SOUTH AFRICA
upvote