GoDaddy Inc. is an American publicly traded Internet domain registry, domain registrar and web hosting company headquartered in Tempe, Arizona. As of 2023, GoDaddy is the world's fifth largest web host by market share, with over 62 million registered domains. The company has around 21 million customers and over 6,900 employees worldwide.

GoDaddy migrates very highly used Hadoop cluster to Amazon S3 with zero business disruption

At a glance

Solution
Data Management
Customer:
GoDaddy
Customer Size:
Corporation (8,000+ employees)
Industry:
Internet hosting
Location:
Tempe, Arizona
Website:
www.godaddy.com
Background:

GoDaddy utilizes an 800-node Apache Hadoop cluster to hold over 2.5 petabytes of customer-related activity and behavior data. This on-premises data lake is critical for guiding business operations and determining the company’s investment strategies. The system is in operation 24x7. It can generate peak loads of more than 100,000 file system events per second, with sustained 12 hour periods processing an average of over 21,000 change operations every second.

Objective:

While the on-premises data lake is business critical, it is aging and running on an old version of Apache Hadoop (2.8). GoDaddy wanted to modernize the implementation by migrating the data to Amazon Web Services (AWS) to take advantage of the modern tooling and analytics capabilities available on AWS, and mitigating the risks and costs associated with maintaining the on-premises Hadoop cluster and the underlying hardware.

Challenge:

The challenge for GoDaddy was how to migrate petabytes of actively changing, “live” data when the business depends on the continued operation of applications in the cluster and access to its data. Any disruption to business operations would be unacceptable and may have prevented a migration from even being attempted.

Solution:

GoDaddy, being a technically oriented company with deep software development skills, often builds their own solutions. As such, they investigated building their own custom migration solution leveraging open source tools. However, it was deemed that performing the initial migration and ongoing synchronization manually is a complex, error-prone task, and not the core competency on which they wanted their highly skilled engineers to spend their time. Instead, following a quick demonstration of a 2TB migration, and a subsequent 10TB proof-of-concept GoDaddy selected Cirata Data Migrator to automate the migration. Data Migrator combines a single scan of the source datasets with processing of the ongoing changes that occur to achieve a complete and continuous data migration. It does not impose any cluster downtime or disruption, and requires no changes to cluster operation or application behavior.

Results:
  • Using Data Migrator, GoDaddy achieved their initial migration goal—to migrate 500TB (over 8.6 million files) of the 2.5PB to AWS S3.
  • Completed the migration process while maintaining normal business operations at all times.
  • Reduced cost and risk of custom data migration development, enabling engineers to focus on other business-critical tasks.
  • Established a new environment using AWS where GoDaddy plans to leverage AWS S3, EMR, Athena and other AWS services to achieve the following:
    • Lower risk by moving off current aging hardware.
    • Meet SLAs for critical ETL processing requirements.
    • Create a better experience for their users through faster queries.
    • Greater agility by putting more data and flexible compute in the hands of data consumers.
    • Improved operational efficiency by alleviating the burden of managing the large and complex on-premises hardware and software infrastructure.
Quote:

“At GoDaddy, deep technical knowledge is in our DNA, and we often build applications in-house to support growth. In the use case of a Hadoop to Amazon S3 data migration and replication, we found Cirata’s Data Migrator to be the optimal approach to deliver the best time to value, rather than running a more time consuming and costly manual migration project internally.”
– Wayne Peacock, Chief Data dnd Analytics Officer, Godaddy

Ready to see your own success story unfold?

Just like we've helped GoDaddy migrate their Hadoop data cluster to Amazon S3, we're here to do the same for you. Reach out today and let's explore how Cirata Data Migrator can transform your data goals.

Cookies and Privacy

We use technology on our website to collect information that helps us enhance your experience and understand what information is most useful to visitors.
By clicking “I ACCEPT,” you agree to the terms of our privacy policy.

Cookie Setting