최신 DAA-C01 무료덤프 - Snowflake SnowPro Advanced: Data Analyst Certification
A Snowflake data analyst is tasked with optimizing the performance of a frequently executed query. The execution plan reveals a 'TableScan' operation on a large table named 'SALES DATR. The 'SALES DATA table is clustered on the 'SALE DATE' column. However, the query predicate uses a range filter on a different column, 'REGION', which is not part of the clustering key. Which of the following strategies would likely improve the query performance significantly? (Select TWO)
정답: A,D
설명: (DumpTOP 회원만 볼 수 있음)
Consider the following Snowflake table schema and data: 'CREATE TABLE products (product_id INTEGER, product_name VARCHAR, properties VARIANT);' Data: 'INSERT INTO products VALUES (1, 'Laptop', "silver", "storage": "512GB", "price": 1200.00}'));' 'INSERT INTO products VALUES (2, 'Mouse', "wireless", "dpi": 1600, "price": 25.00}'));' 'INSERT INTO products VALUES (3, 'Keyboard', PARSE JSON('{"layout": "US", "backlit": true, "price": Which of the following SQL queries will return the 'product_name' and 'price' for all products where the 'price' is greater than 50, ensuring that the 'price' is treated as a numeric value for comparison? Select all that apply
정답: D,E
설명: (DumpTOP 회원만 볼 수 있음)
You've identified a 'Filter' operation in a Snowflake query execution plan that is consuming a significant amount of time. The filter predicate involves a UDF (User-Defined Function) called 'calculate_score(columnl, column2)'. The UDF is written in Python. Analyzing the plan, you observe a high number of rows being processed by this filter. How can you optimize this scenario for faster query execution?
정답: D,E
설명: (DumpTOP 회원만 볼 수 있음)
You are tasked with building a dashboard that visualizes website traffic data stored in Snowflake. The data includes daily unique visitors, bounce rate, and average session duration. The business stakeholders want to understand the correlation between these metrics. They also want to identify any outliers or anomalies. Which chart type is BEST suited for identifying correlation and outliers in this dataset?
정답: C
설명: (DumpTOP 회원만 볼 수 있음)
You have a table named 'SALES DATA' containing daily sales records. You need to identify and handle outliers in the 'SALES AMOUNT' column. Specifically, you want to replace any 'SALES AMOUNT values that fall outside of three standard deviations from the mean with the median 'SALES AMOUNT. What is the most efficient way to achieve this data transformation in Snowflake?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)
You are tasked with creating a data access strategy for a marketing analytics team. They need access to customer purchase data, but only aggregated by region and product category. They should not be able to see individual customer details due to PII compliance. You decide to use a Secure View. Which of the following are the MOST appropriate steps to ensure data security and minimize performance impact?
정답: E
설명: (DumpTOP 회원만 볼 수 있음)
You have a large dataset of IoT sensor readings stored in compressed JSON files within an AWS S3 bucket. Each JSON file contains an array of sensor readings with the following structure:
You need to load this data into a Snowflake table named 'sensor data' with columns 'sensor id', 'timestamp', 'temperature', and 'humidity'. Which of the following Snowflake commands would be the MOST efficient and appropriate to ingest this data, assuming you have already created the table and a named stage pointing to the S3 bucket?

You need to load this data into a Snowflake table named 'sensor data' with columns 'sensor id', 'timestamp', 'temperature', and 'humidity'. Which of the following Snowflake commands would be the MOST efficient and appropriate to ingest this data, assuming you have already created the table and a named stage pointing to the S3 bucket?

정답: E
설명: (DumpTOP 회원만 볼 수 있음)
You are designing a data pipeline in Snowflake that ingests data from multiple external sources with varying schemas and data quality. After ingestion, you need to standardize the data format, handle missing values, and perform data type conversions before loading it into your analytical tables. You need to implement a reusable and maintainable solution. Which approach minimizes code duplication and maximizes data quality?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)
You are analyzing customer churn for a subscription-based service. You have a table 'SUBSCRIPTIONS' with columns: 'CUSTOMER_ID, 'START_DATE', 'END_DATE', 'SUBSCRIPTION TYPE, and 'REVENUE'. You want to classify customers who are likely to churn based on their past subscription behavior. Which Snowflake SQL code snippet is MOST efficient for calculating the number of months each customer was subscribed and identifying those who subscribed for less than 3 months as potential churn candidates?
정답: E
설명: (DumpTOP 회원만 볼 수 있음)
You are using a Snowflake Marketplace data feed that provides daily stock prices. The data is updated daily, and you need to create a process to automatically load the new data into your existing 'STOCK PRICES' table. The Marketplace data feed provides a view called 'MARKETPLACE STOCK PRICES' with columns 'DATE' (DATE), 'SYMBOL' (VARCHAR), and 'PRICE (NUMBER). Your 'STOCK PRICES' table has the same columns. Which of the following Snowflake features or techniques would be BEST suited for automatically loading the new data each day, ensuring that duplicate entries for the same 'DATE and 'SYMBOL' are avoided?
정답: B
설명: (DumpTOP 회원만 볼 수 있음)
A data analyst needs to process a large JSON payload stored in a VARIANT column named 'payload' in a table called 'raw events' The payload contains an array of user sessions, each with potentially different attributes. Each session object in the array has a 'sessionld' , 'userld' , and an array of 'eventS. The events array contains objects with 'eventType' and 'timestamp'. The analyst wants to use a table function to flatten this nested structure into a relational format for easier analysis. Which approach is most efficient and correct for extracting and transforming this data?
정답: E
설명: (DumpTOP 회원만 볼 수 있음)
You have developed a Snowsight dashboard for your marketing team that contains sensitive customer data'. You need to share this dashboard with a specific group of users, but ensure that they can only view the data and cannot modify the dashboard itself or the underlying queries. Which of the following steps should you take to securely share the dashboard?
정답: A
설명: (DumpTOP 회원만 볼 수 있음)