Sora: A Latency Sensitive Approach for Microservice Soft Resource Adaptation

Jianshu Liu, Qingyang Wang, Shungeng Zhang, Liting Hu, Dilma Da Silva

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Fast response time for modern web services that include numerous distributed and lightweight microservices becomes increasingly important due to its business impact. While hardware-only resource scaling approaches (e.g., FIRM [47] and PARSLO [40]) have been proposed to mitigate response time fluctuations on critical microservices, the re-adaptation of soft resources (e.g., threads or connections) that control the concurrency of hardware resource usage has been largely ignored. This paper shows that the soft resource adaptation of critical microservices has a significant impact on system scalability because either under- or over-allocation of soft resources can lead to inefficient usage of underlying hardware resources. We present Sora, an intelligent, fast soft resource adaptation management framework for quickly identifying and adjusting the optimal concurrency level of critical microservices to mitigate service-level objective (SLO) violations. Sora leverages online fine-grained system metrics and the propagated deadline along the critical path of request execution to quickly and accurately provide optimal concurrency setting for critical microservices. Based on six real-world bursty workload traces and two representative microservices benchmarks (Sock Shop and Social Network), our experimental results show that Sora can effectively mitigate large response time fluctuations and reduce the 99th percentile latency by up to 2.5× compared to the hardware-only scaling strategy FIRM [47] and 1.5× to the state-of-the-art concurrency-aware system scaling strategy ConScale.

Original languageEnglish (US)
Title of host publicationMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference
PublisherAssociation for Computing Machinery, Inc
Pages43-56
Number of pages14
ISBN (Electronic)9798400701771
DOIs
StatePublished - Nov 27 2023
Event24th ACM/IFIP International Middleware Conference, Middleware 2023 - Bologna, Italy
Duration: Dec 11 2023Dec 15 2023

Publication series

NameMiddleware 2023 - Proceedings of the 24th ACM/IFIP International Middleware Conference

Conference

Conference24th ACM/IFIP International Middleware Conference, Middleware 2023
Country/TerritoryItaly
CityBologna
Period12/11/2312/15/23

Keywords

  • Auto-scaling
  • Microservices
  • Scalability
  • Soft Resource

ASJC Scopus subject areas

  • Information Systems
  • Software

Fingerprint

Dive into the research topics of 'Sora: A Latency Sensitive Approach for Microservice Soft Resource Adaptation'. Together they form a unique fingerprint.

Cite this