DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Refcards Trend Reports Events Over 2 million developers have joined DZone. Join Today! Thanks for visiting DZone today,
Edit Profile Manage Email Subscriptions Moderation Admin Console How to Post to DZone Article Submission Guidelines
View Profile
Sign Out
Refcards
Trend Reports
Events
Zones
Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Partner Zones AWS Cloud
by AWS Developer Relations
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Partner Zones
AWS Cloud
by AWS Developer Relations
  1. DZone
  2. Software Design and Architecture
  3. Performance
  4. Ana Medina at Reactive Summit: “Resiliency Is Extremely Hard, Especially When Running Anything at Scale.”

Ana Medina at Reactive Summit: “Resiliency Is Extremely Hard, Especially When Running Anything at Scale.”

Ana Medina, chaos engineer at Gremlin, talks about chaos engineering and cloud computing ahead of her talk at Reactive Summit.

Kathleen Hayes user avatar by
Kathleen Hayes
·
Sep. 18, 18 · Interview
Like (1)
Save
Tweet
Share
3.90K Views

Join the DZone community and get the full member experience.

Join For Free
“In all chaos, there is a cosmos, in all disorder a secret order.”
-Carl Jung

A Chaos Engineer at Gremlin, Ana helps companies avoid outages by running proactive chaos engineering experiments. At Reactive Summit in Montreal this October, Ana offers a deep-dive into the world of chaos engineering with her talk“The Practice of Chaos Engineering.”

In anticipation of Ana’s talk at Reactive Summit, we spoke to Ana about her developer journey, the importance of reliability and how breaking things on purpose can help organizations build more reliable systems.

What is your background and what sparked your interest in chaos engineering and cloud computing?

I began writing code at the age of 13. There was something fun about building things out of nothing, and it enabled me to teach myself a few languages early on. I did a lot of work on web and mobile development and then transitioned to the Infrastructure world. I stumbled upon Cloud Computing and Chaos Engineering when I first started working at Uber on their Site Reliability Engineering team focusing on their Chaos Engineering tool, uDestroy. I then transferred to Uber’s Cloud Infrastructure team where I worked on building a tool for bringing Uber to the Cloud using GCP and AWS. I rapidly learned that working on internal infrastructure tools was interesting to me. I found the value in building services, especially when it meant keeping a company like Uber, with thousands of microservices, reliable. I’m now working at Gremlin as a Chaos Engineer, helping companies avoid downtime by proactively running chaos engineering experiments.

What problems do you solve as a part of your job?

Working at a small startup means the type of problems I work on are constantly changing. As a pioneer in the chaos engineering space, I get to help others see the ways their infrastructure can break at any moment, so they should break it on purpose to build more resilient systems. My favorite part of my job is being able to learn about the infrastructure of different companies, their struggles with the scalability of their microservices.

Reactive is a new buzzword for many traditional developers. What is your prediction for its importance in application development over the next couple of years?

I strongly believe in making reliability one of the first core things to think about when starting to develop a service, application or company. Reactive makes that a priority and I believe reactive development is only going to get more prominent in companies.

What is the biggest challenge companies deploying distributed Reactive systems are facing?

I’m a bit biased in this question as I spend most of my time helping others learn how to use chaos engineering to build more resilient systems. Resiliency is extremely hard, especially when in microservice architectures or when running anything at scale. The last thing any company wants is to have any sort of downtime or failure in their systems. A multi-day outage is unacceptable.

What is the best solution to this challenge?

Break things on purpose is what we like to say Gremlin. Chaos Engineering is the practice of thoughtfully planned experiments designed to reveal the weakness in our systems. There are too many points of failures in applications these days, but you can inject chaos at any layer.

What is your most ambitious professional dream that you hope to achieve one day?

I’ve had a few, but I think the most ambitious one is to one day become president of Costa Rica.

Who should attend your talk and what will they learn?

My talk is open to engineers of all levels, though beginners and intermediate engineers would enjoy it best. They will learn what chaos engineering is, why it’s important and how to get started.

With whom would you like to connect at the conference?

Everyone and anyone! It’s my first time interacting with the Reactive Systems world and I feel like it’s a good conference to network and learn.

Chaos engineering

Published at DZone with permission of Kathleen Hayes. See the original article here.

Opinions expressed by DZone contributors are their own.

Popular on DZone

  • Asynchronous Messaging Service
  • mTLS Everywere
  • Microservices 101: Transactional Outbox and Inbox
  • Is DevOps Dead?

Comments

Partner Resources

X

ABOUT US

  • About DZone
  • Send feedback
  • Careers
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 600 Park Offices Drive
  • Suite 300
  • Durham, NC 27709
  • support@dzone.com
  • +1 (919) 678-0300

Let's be friends: