Tuesday, Apr 7, 2026 · 6:30 PM – 8:00 PM

Architecting Distributed Systems for Failure, Adaptation, and Long Term Evolution

Kirill Andreev
Principal Backend Engineer

About the talk

Modern backend systems are no longer judged only by throughput or uptime. They are judged by how well they absorb failure, how safely they evolve under continuous product change, and how effectively they support teams working across increasingly complex platform environments.

This session will examine the architectural principles behind distributed systems that remain dependable under stress and sustainable over time. Rather than treating outages, degraded dependencies, and conflicting consistency requirements as edge cases, the talk will focus on designing systems in which these conditions are expected and systematically accounted for.

The discussion will look at how experienced engineering teams approach partial failure, retries, timeout strategy, idempotent operations, asynchronous workflows, and service boundaries. It will also address the longer term challenge of platform evolution, including how to avoid repeated large scale rewrites, reduce architectural fragility, and create systems that can adapt as product, traffic, and organisational demands change.

Using real engineering patterns and production oriented thinking, the session will offer a practical view of how backend architecture can move beyond short term scalability and towards resilience, maintainability, and stronger technical standards across the industry.

This talk will cover

• Designing for partial failure in distributed environments
• Timeout, retry, and idempotency strategies in production systems
• Service boundaries and dependency management at scale
• Trade offs between consistency, availability, and operational simplicity
• Architectural patterns that support long term system evolution

About the speaker

Kirill Andreev

Kirill Andreev is a senior backend engineer with more than twelve years of experience building high-load platforms, internal infrastructure, and distributed services for digital products. His work has focused on system reliability, platform design, and backend architecture that supports both product growth and engineering sustainability. He is particularly interested in how production constraints shape better technical decisions and how strong backend practices can raise standards across the wider technology community.

LinkedIn