Simple Testing Can Prevent Most Critical Failures: An Analysis of Production Failures in Distributed {Data-Intensive} Systems D Yuan, Y Luo, X Zhuang, GR Rodrigues, X Zhao, Y Zhang, PU Jain, ... 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2014 | 261 | 2014 |
lprof: A non-intrusive request flow profiler for distributed systems X Zhao, Y Zhang, D Lion, MF Ullah, Y Luo, D Yuan, M Stumm 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2014 | 181 | 2014 |
Pensieve: Non-intrusive failure reproduction for distributed systems using the event chaining approach Y Zhang, S Makarov, X Ren, D Lion, D Yuan Proceedings of the 26th Symposium on Operating Systems Principles, 19-33, 2017 | 52 | 2017 |
The inflection point hypothesis: a principled debugging approach for locating the root cause of a failure Y Zhang, K Rodrigues, Y Luo, M Stumm, D Yuan Proceedings of the 27th ACM Symposium on Operating Systems Principles, 131-146, 2019 | 32 | 2019 |
Understanding and detecting software upgrade failures in distributed systems Y Zhang, J Yang, Z Jin, U Sethi, K Rodrigues, S Lu, D Yuan Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles …, 2021 | 25 | 2021 |
Systems and processes for computer log analysis M Faizanullah, L David, Y Luo, M Stumm, D Yuan, X Zhao, Y Zhang US Patent 9,729,671, 2017 | 15 | 2017 |
Efficiently detecting concurrency bugs in persistent memory programs Z Chen, Y Hua, Y Zhang, L Ding Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 8 | 2022 |
SAC: exploiting stable set model to enhance cacheFiles JL Liu, YL Zhang, L Yang, MY Guo, ZJ Liu, L Xu Journal of Computer Science and Technology 29 (2), 293-302, 2014 | 5 | 2014 |
Stable set model based methods for large-capacity client cache management M Guo, L Liu, Y Zhang, Z Liu, L Xu 2012 IEEE 14th International Conference on High Performance Computing and …, 2012 | 3 | 2012 |
Systems and processes for computer log analysis M Faizanullah, L David, Y Luo, M Stumm, D Yuan, X Zhao, Y Zhang US Patent 10,484,506, 2019 | 1 | 2019 |
Automatic Failure Diagnosis for Distributed Systems Y Zhang University of Toronto (Canada), 2021 | | 2021 |