DSA-C02 Online Practice Questions and Answers

Questions 4

What is the formula for measuring skewness in a dataset?

A. MEAN - MEDIAN

B. MODE - MEDIAN

C. (3(MEAN - MEDIAN))/ STANDARD DEVIATION

D. (MEAN - MODE)/ STANDARD DEVIATION

Buy Now

Questions 5

Which is the visual depiction of data through the use of graphs, plots, and informational graphics?

A. Data Interpretation

B. Data Virtualization

C. Data visualization

D. Data Mining

Buy Now

Questions 6

Which ones are the known limitations of using External function? Choose all apply.

A. Currently, external functions cannot be shared with data consumers via Secure Data Sharing.

B. Currently, external functions must be scalar functions. A scalar external function re-turns a single value for each input row.

C. External functions have more overhead than internal functions (both built-in functions and internal UDFs) and usually execute more slowly

D. An external function accessed through an AWS API Gateway private endpoint can be accessed only from a Snowflake VPC (Virtual Private Cloud) on AWS and in the same AWS region.

Buy Now

Questions 7

A Data Scientist as data providers require to allow consumers to access all databases and database objects in a share by granting a single privilege on shared databases. Which one is incorrect SnowSQL command used by her while doing this task?

Assuming:

A database named product_db exists with a schema named product_agg and a table named Item_agg.

The database, schema, and table will be shared with two accounts named xy12345 and yz23456.

1.USE ROLE accountadmin;

2.CREATE DIRECT SHARE product_s;

3.GRANT USAGE ON DATABASE product_db TO SHARE product_s;

4.GRANT USAGE ON SCHEMA product_db. product_agg TO SHARE product_s;

5.GRANT SELECT ON TABLE sales_db. product_agg.Item_agg TO SHARE product_s; 6.SHOW GRANTS TO SHARE product_s;

7.ALTER SHARE product_s ADD ACCOUNTS=xy12345, yz23456;

8.SHOW GRANTS OF SHARE product_s;

A. GRANT USAGE ON DATABASE product_db TO SHARE product_s;

B. CREATE DIRECT SHARE product_s;

C. GRANT SELECT ON TABLE sales_db. product_agg.Item_agg TO SHARE product_s;

D. ALTER SHARE product_s ADD ACCOUNTS=xy12345, yz23456;

Buy Now

Questions 8

Which type of Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames and return batches of results as Pandas arrays or Series?

A. MPP Python UDFs

B. Scaler Python UDFs

C. Vectorized Python UDFs

D. Hybrid Python UDFs

Buy Now

Questions 9

Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the aggregate method shown in below code do?

g = df.groupby(df.index.str.len())

A. aggregate({'A':len, 'B':np.sum})

B. Computes Sum of column A values

C. Computes length of column A

D. Computes length of column A and Sum of Column B values of each group

E. Computes length of column A and Sum of Column B values

Buy Now

Questions 10

Consider a data frame df with 10 rows and index [ 'r1', 'r2', 'r3', 'row4', 'row5', 'row6', 'r7', 'r8', 'r9', 'row10']. What does the expression g = df.groupby(df.index.str.len()) do?

A. Groups df based on index values

B. Groups df based on length of each index value

C. Groups df based on index strings

D. Data frames cannot be grouped by index values. Hence it results in Error.

Buy Now

Questions 11

Select the correct mappings:

I. W Weights or Coefficients of independent variables in the Linear regression model --> Model Pa-rameter

II. K in the K-Nearest Neighbour algorithm --> Model Hyperparameter

III. Learning rate for training a neural network --> Model Hyperparameter

IV.

Batch Size --> Model Parameter

I,II

I,II,III

III,IV

II,III,IV

Buy Now

Correct Answer: B

Explanation:

Hyperparameters in Machine learning are those parameters that are explicitly defined by the user to control the learning process. These hyperparameters are used to improve the learning of the model, and their values are set before starting

the learning process of the model.

What are hyperparameters?

In Machine Learning/Deep Learning, a model is represented by its parameters. In contrast, a training process involves selecting the best/optimal hyperparameters that are used by learning algorithms to provide the best result. So, what are

these hyperparameters? The answer is, "Hyperparameters are defined as the parameters that are explicitly defined by the user to control the learning process."

Here the prefix "hyper" suggests that the parameters are top-level parameters that are used in con-trolling the learning process. The value of the Hyperparameter is selected and set by the machine learning engineer before the learning

algorithm begins training the model. Hence, these are external to the model, and their values cannot be changed during the training process.

Some examples of Hyperparameters in Machine Learning ?The k in kNN or K-Nearest Neighbour algorithm

Learning rate for training a neural network

Train-test split ratio

Batch Size

Number of Epochs

Branches in Decision Tree

Number of clusters in Clustering Algorithm

Model Parameters:

Model parameters are configuration variables that are internal to the model, and a model learns them on its own. For example, W Weights or Coefficients of independentvariables in the Linear regression model. or Weights or Coefficients of

independent variables in SVM, weight, and biases of a neural network, cluster centroid in clustering. Some key points for model parameters are as follows:

They are used by the model for making predictions.

They are learned by the model from the data itself.

These are usually not set manually.

These are the part of the model and key to a machine learning Algorithm.

Model Hyperparameters:

Hyperparameters are those parameters that are explicitly defined by the user to control the learning process. Some key points for model parameters are as follows:

These are usually defined manually by the machine learning engineer. One cannot know the exact best value for hyperparameters for the given problem. The best value can be determined either by the rule of thumb or by trial and error. Some

examples of Hyperparameters are the learning rate for training a neural network, K in the KNN algorithm.

Questions 12

Mark the incorrect statement regarding usage of Snowflake Stream and Tasks?

A. Snowflake automatically resizes and scales the compute resources for serverless tasks.

B. Snowflake ensures only one instance of a task with a schedule (i.e. a standalone task or the root task in a DAG) is executed at a given time. If a task is still running when the next scheduled execution time occurs, then that scheduled time is skipped.

C. Streams support repeatable read isolation.

D. An standard-only stream tracks row inserts only.

Buy Now

Questions 13

Which of the following cross validation versions is suitable quicker cross-validation for very large datasets with hundreds of thousands of samples?

A. k-fold cross-validation

B. Leave-one-out cross-validation

C. Holdout method

D. All of the above

Buy Now

Exam Code: DSA-C02

Exam Name: SnowPro Advanced: Data Scientist Certification (DSA-C02)