By migrating to Google Cloud, Scotiabank aims to unlock next-generation analytics capabilities while maintaining the integrity and performance of their existing Hadoop infrastructure during the transition.
The bank couldn’t take a big bang approach to their cloud aspirations, as analysis of every application, Spark workload and map reduce job was required to determine priorities and to evaluate whether any remediation was required when running on a cloud infrastructure. A phased approach was required to systematically move the artifacts into Google. The bank had initially looked at the DistCp utility to assist them in moving data sets into the cloud, however as DistCp runs as a standard MapReduce job competing for resources with other important workloads, it was felt that this additional overhead would not be acceptable in an infrastructure already struggling with compute capacity. The DistCp solution also required multiple executions to capture ongoing data changes which again added to the resourcing overload.
Cirata Data migrator was chosen as the primary transfer tool for the following reasons:
Cirata Data Migrator enabled Scotiabank to: