Detail kurzu

SAS(R) Data Management Tools and Applications

EDU Trainings s.r.o.

Popis kurzu

This is the second course in the Data Curation Professional, SAS Academy for Data Science program. The program is required to earn your SAS data science certification. Designed for SAS data scientists, this program covers SAS topics for data curation techniques, including big data preparation with Hadoop. In this course, you discover how to access your data from a variety of sources, create processes to manage and transform data, and ensure the reliability and consistency of your data.

Obsah kurzu

SAS/ACCESS Technology OverviewSAS/ACCESS technology overview.SAS Data Integration Studio: EssentialsExploring the SAS platform and SAS Data Integration Studio.Exploring SAS Data Integration Studio basics. Examining SAS Data Integration Studio jobs and options.SAS Data Integration Studio: Defining Source Data MetadataSetting up the environment.Defining metadata for a library.Registering metadata for data sources. Registering SAS table metadata. Registering DBMS table metadata. Registering ODBC data source table metadata. Registering metadata for external files.SAS Data Integration Studio: Defining Target Data MetadataRegistering metadata for target tables.Importing metadata.SAS Data Integration Studio: Working with JobsCreating metadata for jobs.Working with the Join transformation.SAS Data Integration Studio: Working with TransformationsWorking with the Extract and Summary Statistics transformations.Exploring the SQL transformations.Creating custom transformations.Introduction to Data Quality and the SAS Quality Knowledge BaseIntroduction to data quality.SAS Quality Knowledge Base overview.DataFlux Data Management Studio: EssentialsOverview of Data Management Studio. DataFlux Repositories. Quality Knowledge Bases and reference data sources.Data connections.DataFlux Data Management Studio: Understanding DataMethodology review.Creating data collections.Designing data explorations.Creating data profiles.Profiling other input types.Designing data standardization schemes.DataFlux Data Management Studio: Building Data Jobs to Improve DataIntroduction to data jobs.Standardization, parsing, and casing.Identification analysis and right fielding.Branching and gender analysis.Data enrichment.DataFlux Data Management Studio: Building Data Jobs for Entity ResolutionCreating match codes.Clustering records.Survivorship.Understanding the SAS Quality Knowledge Base (QKB)Working with QKB component files.Working with QKB definitions.Using SAS Code to Access QKB ComponentsSAS configuration options for accessing the QKB.SAS Data Quality Server overview.
Certifikát Na dotaz.
Hodnocení




Organizátor