Join Our Work Group Visit Our GitHub Repo

Lessons from Systems Testing

Quality assurance professionals have long dealt with the challenges of testing whole systems, where determinism and isolation are not always possible, reflecting the real world conditions that systems face. What lessons can we learn from this expertise?

For example, most real, “enterprise-grade” applications today are distributed systems where various services are invoked asynchronously, sometimes across different cloud environments, and the application has to orchestrate the invocations and processing of responses. Any distributed system is inherently nondeterministic, because concurrent activities in real-world systems won’t have deterministic response times, etc. As one example, network congestion between distributed services means that responses won’t arrive in a predictable order.

Fortunately, these kinds of nondeterminism are bounded and well understood. Standard techniques for handling them are mature and standard practice for experienced teams. So, the first lesson for us is how can we quantify and bound the possible results from models we use?

TODOs:

Explore more general concepts and techniques from this community.
Provide specific examples of how to use those concepts.