Scenario

Enterprises running self-built Apache Hive data warehouse systems are seeking to migrate workloads to Alibaba Cloud E-MapReduce (EMR). A typical example is a Hive data warehouse deployed in a Hadoop cluster that runs over HDFS storage. The Hadoop cluster is deployed either on premises or on a cloud-based virtual machine, and is used for the extract, transform, load (ETL) process.

This solution helps you migrate self-built Hadoop cluster and Hive system to Alibaba Cloud EMR. This solution provides an affordable yet secure network for data migration over a VPN tunnel; enables you to host your Hadoop and Hive workloads over easy-to-deploy infrastructure resources; frees you from the maintenance of infrastructure when you use this managed service; allows you to directly perform analytics on the data using supported services, such as Hadoop, Hive, Spark, and Flink; and provides advanced identity and authorization management features.