Shi Jin

Shi Jin

Senior SDE at AWS Annapurna Labs
PhD in Physics, UW


AWS Annapurna Labs

University of Washington

Seattle, WA

sjina@amazon.com

kingstone1991@gmail.com

shijin-aws

Google Scholar

About

I am a Senior Software Development Engineer at AWS Annapurna Labs, working on high-performance communication libraries built on top of the AWS Elastic Fabric Adapter (EFA) — a custom RDMA-capable NIC designed for tightly-coupled HPC and large-scale ML workloads.

My primary focus is libfabric (OFI), the open-source network fabric API that provides portable access to high-speed transports including EFA, InfiniBand, and RoCE. I also contribute significantly to Open MPI for HPC collective communication. Occasionally I collaborate on related projects such as aws-ofi-nccl, NIXL, and UCCL.

Before joining AWS, I obtained a PhD in physics from the University of Washington. During my graduate years, working with Prof. Aurel Bulgac, I studied quantum many-body problems in nuclear physics. My research primarily involved the application of density functional theory (DFT) and its time-dependent extension (TDDFT) on superfluid many-fermion systems — from static nuclear structure to nuclear fission and reactions.

I have expertise in high-performance computing, including MPI, OpenMP, and CUDA C programming, and a solid background in applied mathematics, especially the numerical solution of partial differential equations (PDEs).


I was born in 1991 in Wuhu, Anhui, China. I received a B.S. in physics from the University of Science and Technology of China (USTC) in 2013 before joining the graduate program at the University of Washington.


Open Source Projects

libfabric Primary

Open Fabric Interfaces — a high-performance networking API providing portable access to RDMA transports including AWS EFA, InfiniBand, and RoCE. My main open source project.

797 506 C
Open MPI Primary

The standard open-source MPI implementation for HPC. I contribute to collective communication and EFA transport support.

A plugin bridging NCCL and libfabric, enabling distributed deep learning training on AWS EFA.

216 99 C
NIXL  /  UCCL

Occasional collaborations on next-generation GPU communication: NIXL for AI inference data transfer, UCCL for flexible GPU collective communication.

1073 / 1403 C++/CUDA

Latest News

2023-10-26 Blog
EFA: how fixing one thing, led to an improvement for everyone — AWS HPC Blog (with Brendan Bouffler).
2021-04-06 Article
2020-03-04 Article
2019-09-19 Article
2019-07-24 Article
2019-05-30 Final Exam
Defended my PhD thesis. Now I am Dr. Jin!
2019-03-10 Conference
Attended the CENTAUR annual review at Texas A&M University.
2018-06-04 Conference
Attended the 2018 OLCF GPU Hackathon, Boulder, CO.
2018-04-14 Conference
Gave talk "Fission dynamics within TDDFT" at APS April Meeting, Columbus, OH.
2018-03-12 Article
2018-02-21 Workshop
Attended SSAP Symposium 2018, Rockville, MD.
2017-09-17 School
Attended LANL FIESTA fission school, Santa Fe, NM.
2017-07-31 Article
2017-07-11 Exam
Passed my general exam. Now a PhD candidate!
2017-06-06 Seminar
Gave seminar "Induced fission of 240Pu: sampling from configuration space" at NTG/INT Brown Bag, UW.
2017-04-03 Article