Postdoctoral Appointee - Large Scale Data Management and Storage for HPC/AI
Job posting number: #7121970 (Ref:414746)
Posted: January 24, 2023
Application Deadline: Open Until Filled
We are looking for highly skilled and motivated Postdoctoral Appointees to join our efforts around large-scale data storage and management for distributed computing systems and in particular HPC infrastructures. In this context, scientific instruments and hybrid HPC workloads (that combine simulations, big data analytics, and AI) generate, store and access huge amounts of data. You will be at the forefront of addressing major challenges and research questions in the design and development of runtimes that enable data management in a scalable, high-performance and resource-efficient fashion.
- We are interested in aspects: of caching, prefetching, versioning and lineage of data, checkpointing and intermediate data snapshots, incremental storage and evolution of data, placement on heterogeneous storage, etc.
- We are building on our experience with data models that capture the evolution of distributed datasets into a searchable lineage of snapshots that enable efficient storage and revisiting (DataStates) and well as checkpoint-restart systems (VELOC).
- We make use of composable services that bring flexibility (typically missing in HPC) around communication, concurrency management and building blocks such as BLOB and key-value stores (Mochi).
- We apply such data management techniques and principles in a variety of scenarios: AI network architecture search based on transfer learning, optimized data pipeline for large-scale AI training, AI model repositories with fine-grain incremental tensor storage and access, reducing I/O overheads of adjoint computations, reproducibility of workflows, etc.
In addition to addressing such transformative challenges,
- You will have the opportunity to get involved in several other efforts at the intersection of HPC, machine learning and big data analytics.
- We work closely with many domain experts to identify the requirements and bottlenecks of real-life scientific applications that address the needs of our society over the next decades.
- You will be part of a vibrant and diverse research community from more than 100 countries.
- Our lab hosts Aurora, one of the first Exascale supercomputers in the world, which you will have an opportunity to use for your experiments.
- In addition, you will have access to a large array of leading-edge experimental testbeds through the Joint Laboratory for System Evaluation (JLSE), which feature the latest technologies from top vendors like Intel, NVIDIA, AMD, etc.
- A recent or soon-to-be completed PhD degree (typically within the last 0-3 years)
- Familiarity with data management techniques: caching, indexing, asynchronous I/O
- Ability to conduct interdisciplinary research and participate in teamwork and broad collaborative efforts involving other laboratories and universities, supercomputer centers and industry
- Ability to model Argonne's core values: impact, respect, integrity, teamwork and safety
- Scientific background in distributed computing and HPC including:
- Strong code development skills with C/C++ and Python
- Familiarity with modern data management and I/O best practices
Job FamilyPostdoctoral Family
Job ProfilePostdoctoral Appointee
Worker TypeLong-Term (Fixed Term)
Time TypeFull time
As an equal employment opportunity and affirmative action employer, and in accordance with our core values of impact, safety, respect, integrity and teamwork, Argonne National Laboratory is committed to a diverse and inclusive workplace that fosters collaborative scientific discovery and innovation. In support of this commitment, Argonne encourages minorities, women, veterans and individuals with disabilities to apply for employment. Argonne considers all qualified applicants for employment without regard to age, ancestry, citizenship status, color, disability, gender, gender identity, gender expression, genetic information, marital status, national origin, pregnancy, race, religion, sexual orientation, veteran status or any other characteristic protected by law.
Argonne employees, and certain guest researchers and contractors, are subject to particular restrictions related to participation in Foreign Government Sponsored or Affiliated Activities, as defined and detailed in United States Department of Energy Order 486.1A. You will be asked to disclose any such participation in the application phase for review by Argonne's Legal Department.
All Argonne offers of employment are contingent upon a background check that includes an assessment of criminal conviction history conducted on an individualized and case-by-case basis. Please be advised that Argonne positions require upon hire (or may require in the future) for the individual be to obtain a government access authorization that involves additional background check requirements. Failure to obtain or maintain such government access authorization could result in the withdrawal of a job offer or future termination of employment.
Please note that all Argonne employees are required to be vaccinated against COVID-19. All successful applicants will be required to provide their COVID-19 vaccination verification as a condition of employment, subject to limited legally recognized exemptions to COVID-19 vaccination.
Argonne is an equal opportunity employer, and we value diversity in our workforce. As an equal employment opportunity and affirmative action employer, Argonne National Laboratory is committed to a diverse and inclusive workplace that fosters collaborative scientific discovery and innovation. In support of this commitment, Argonne prohibits discrimination or harassment based on an individual's age, ancestry, citizenship status, color, disability, gender, gender identity, genetic information, marital status, national origin, pregnancy, race, religion, sexual orientation, veteran status or any other characteristic protected by law.