Data Fusion (Cont.)


AWS (Amazon Web Services) Glue
AWS Glue is a fully managed ETL (extract, transform, and load) service that makes it simple to categorize your data, clean it, enrich it, and move it reliably between various data stores and data streams. AWS Glue is serverless, so there’s no infrastructure to set up or manage.

AWS Glue consists of a central metadata repository known as the AWS Glue Data Catalog, an ETL engine that automatically generates Python or Scala code, and a flexible scheduler that handles dependency resolution, job monitoring, and retries.

Google Cloud Data Fusion vs AWS Glue
Votes between AWS Glue and Google Cloud Data Fusion are given on the right. They are for reference only because it is subjective.

Review: AWS (Amazon Web Services) Glue
    Which component is NOT included in the AWS (Amazon Web Services) Glue?

      Data Catalog
      ETL Engine
      Scheduler
      Wrangler
Result:        




      If you’re absent during my struggle,    
      don’t expect to be present during my success.