Microsoft 70-475 Actual Free Exam Questions & Community Discussion

  • Exam Code/Number: 70-475
  • Exam Name/Title: Design and Implement Big Data Analytics Solutions
  • Certification Provider: Microsoft
  • Corresponding Certification: Microsoft Azure
  • Exam Questions: 122
  • Updated On: Jun 01, 2026
You are using a Microsoft Azure Data Factory pipeline to copy data to an Azure SQL database.
You need to prevent the insertion of duplicate data for a given dataset slice.
Which two actions should you perform? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
Correct Answer: A,C Vote an answer
You have four on-premises Microsoft SQL Server data sources as described in the following table.

You plan to create three Azure data factories that will interact with the data sources as described in the following table.

You need to deploy Microsoft Data Management Gateway to support the Azure Data Factory deployment. The solution must use new servers to host the instances of Data Management Gateway.
What is the minimum number of new servers and data management gateways you should you deploy? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Correct Answer:

Explanation

Box 1: 3
Box 2: 3
Considerations for using gateway
You have data pushed to Microsoft Azure Blob storage every few minutes.
You want to use an Azure Machine Learning web service to score the data hourly.
You plan to deploy the data factory pipeline by using a Microsoft.NET application.
You need to create an output dataset for the web service.
Which three properties should you define? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
Correct Answer: A,B,E Vote an answer
You are designing a data-driven data flow in Microsoft Azure Data Factory to copy data from Azure Blob storage to Azure SQL Database.
You need to create the copy activity.
How should you complete the JSON code? To answer, drag the appropriate code elements to the correct targets. Each element may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content NOTE: Each correct selection is worth one point.
Correct Answer:

Explanation
You plan to implement a Microsoft Azure Data Factory pipeline. The pipeline will have custom business logic that requires a custom processing step.
You need to implement the custom processing step by using C#.
Which interface and method should you use? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Correct Answer:

Explanation

References:
https://github.com/MicrosoftDocs/azure-docs/blob/master/articles/data-factory/v1/data-factory-use-custom-activ
You have a Microsoft Azure Data Factory pipeline.
You discover that the pipeline fails to execute because data is missing.
You need to rerun the failure in the pipeline.
Which cmdlet should you use?
Correct Answer: A Vote an answer
Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
Your company has multiple databases that contain millions of sales transactions.
You plan to implement a data mining solution to identity purchasing fraud.
You need to design a solution that mines 10 terabytes (TB) of sales data. The solution must meet the following requirements:
* Run the analysis to identify fraud once per week.
* Continue to receive new sales transactions while the analysis runs.
* Be able to stop computing services when the analysis is NOT running.
Solution: You create a Microsoft Azure HDlnsight cluster.
Does this meet the goal?
Correct Answer: B Vote an answer
Explanation: Only visible for EduDump members. You can sign-up / login (it's free).
You plan to deploy Microsoft Azure HDInsight clusters for business analytics and data pipelines. The clusters must meet the following requirements:
* Business users must use a language that is similar to SQL.
* The authoring of data pipelines must occur in a dataflow language.
You need to identify which language must be used for each requirement.
Which languages should you identify? To answer, drag the appropriate languages to the correct requirements.
Each language may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
Correct Answer:

Explanation
You have data in an on-premises Microsoft SQL Server database.
You must ingest the data in Microsoft Azure Blob storage from the on-premises SQL Server database by using Azure Data Factory.
You need to identify which tasks must be performed from Azure.
In which sequence should you perform the actions? To answer, move all of the actions from the list of actions to the answer area and arrange them in the correct order.
NOTE: More than one order of answer choices is correct. You will receive credit for any of the correct orders you select.
Correct Answer:

Explanation

Step 1: Configure a Microsoft Data Management Gateway
Install and configure Azure Data Factory Integration Runtime.
The Integration Runtime is a customer managed data integration infrastructure used by Azure Data Factory to provide data integration capabilities across different network environments. This runtime was formerly called
"Data Management Gateway".
Step 2: Create a linked service for Azure Blob storage
Create an Azure Storage linked service (destination/sink). You link your Azure storage account to the data factory.
Step 3: Create a linked service for SQL Server
Create and encrypt a SQL Server linked service (source)
In this step, you link your on-premises SQL Server instance to the data factory.
Step 4: Create an input dataset and an output dataset.
Create a dataset for the source SQL Server database. In this step, you create input and output datasets. They represent input and output data for the copy operation, which copies data from the on-premises SQL Server database to Azure Blob storage.
Step 5: Create a pipeline..
You create a pipeline with a copy activity. The copy activity uses SqlServerDataset as the input dataset and AzureBlobDataset as the output dataset. The source type is set to SqlSource and the sink type is set to BlobSink.
References: https://docs.microsoft.com/en-us/azure/data-factory/tutorial-hybrid-copy-powershell
Your company supports multiple Microsoft Azure subscriptions.
You plan to deploy several virtual machines to support the services in Azure.
You need to automate the management of all the subscriptions. The solution must minimize administrative effort.
Which two cmdlets should you run? Each correct answer presents part of the solution.
NOTE: Each correct selection is worth one point.
Correct Answer: D,E Vote an answer
0
0
0
10