Inspirational journeys

Follow the stories of academics and their research expeditions

AWS Certified Big Data - Specialty Certification - Part 19

Mary Smith

Wed, 15 Apr 2026

AWS Certified Big Data - Specialty Certification - Part 19

1. You develop a program that will use the database to store data Dynamo. The app will handle a large number of read operations. Which of the following aspects of the design can really help minimize the impact of read operations on Dynamo DB target table? Please, choose:

A) Deploying secondary Dynamo DB destination table to absorb the read operation
B) Deploying the copies Dynamo DB Table
C) Deploying database secondary RDS for absorption reads
D) None
E) Deploying Elastic Cache Solution


2. You are currently developing a red shift of the table in the AWS. The information contained in this table do not change frequently. In addition, this table does not participate in the same breath. Which of the following distribution of the style should be preferred for this table Select:

A) not
B) None
C) key
D) All
E) Also


3. Your EMR cluster uses twelve copies m4.large and work 24 hours a day, but it is only used for the processing and reporting of working time. The parameters that you can use to cut costs? Choose two of the options listed below?(Select 2answers)

A) Using the auto scaling to scale and scale the cluster if necessary.
B) Transferring data from the HDFS to 53 via S3DispCp cluster and close when it is not used
C) Using Spot Instances of data nodes when needed
D) The use of reserved copies for the site problem



4. You have a test environment at the moment, which consists of redshift clusters. Now that the test phase is completed, the cluster will be used for industrial applications, components to be used for a long period of time. Which of the following measures can be taken to reduce the cost of the redshift of the cluster?

A) Using the Spot instances, because they are cheaper than on-demand instances
B) The use of reserved instances for better overall discounts
C) None
D) Using nodes on demand, so that they can be shutdown when necessary.
E) Using consolidated billing to reduce costs


5. Which of the following can be used to obtain a higher quality model using Amazon Machine Learning? Please, choose:(Select 2answers)

A) Model data tags
B) carry ratings
C) Using constant data sources
D) using management



1. Right Answer: E
Explanation: The solution would be to cache reads at the application level. Caching is a technique used in many high-bandwidth applications. reading act Mty handling hot items in the cache, and not in the database. Your application can cache the most popular items in memory, or to use the product Elastic Cache to do the same.

2. Right Answer: D
Explanation: A copy of the entire table are distributed each node. Where uniform distribution key position or distribution of only a part of the rows of the table at each node. All guarantees distribution, that each line for each site hosted on the movements that table participants.

3. Right Answer: B,C
Explanation: Spot instances can be used to reduce the cost of the basic units. Below is a case study in order to optimize costs in the OM. Strategies to reduce the costs of your Amazon EMR

4. Right Answer: B
Explanation: If you are going to hold Shift, Amazon Cluster Red operate continuously for a long time, you should consider buying the allocated site offers. These offerings provide significant savings pricing on demand, but they need to order and compute nodes are obliged to pay for these components for either one year or three years duration.

5. Right Answer: B,D
Explanation: Regulation is a method of machine learning, which you can use to achieve higher quality models. measuring the quality assessment of the ML and determine if it is going well.

0 Comments

Leave a comment