Dask is an open-source library for parallel computing in Python that interoperates with existing Python data science libraries like Numpy, Pandas, Scikit-Learn, and Jupyter.  Dask is used today across many different scientific domains.

Recently, we’ve observed an increase in use in a few life sciences applications:

  1. Large scale imaging in microscopy
  2. Single cell analysis
  3. Genomics

We see early adopters in these fields use the Dask array library along with other libraries and file formats specific to their discipline.

We would like to accelerate growth of Dask in these fields by contracting someone to assist with building out functionality for these early adopter groups, and then communicate that functionality more broadly to a wide audience.

Role

You will be tasked with bridging the gap between life science groups and the broader Dask and PyData development communities.  This will require a variety of both technical and communication activities, including sourcing requirements from several different science groups, contributing to open source software, and coaching science groups to build proof-of-concept computations.  Additionally, you will disseminate these results through a variety of communication channels like blogs, webinars, and on-line trainings to downstream science communities.

Requirements

These are not hard requirements – if you feel you fulfill most of these criteria, please do apply! We know we’re not perfect communicators, so think of this as the starting point for discussion.

We’re hoping to find someone who has experience with … :

  • Array computing using Numpy and the PyData stack
  • The life sciences
  • Maintaining community open source software (git, testing, communication, and patience)
  • Communication skills, both in a remote work environment and with the broader scientific public

Additionally, it would be nice to have experience with …

  • Dask
  • Teaching or running workshops
  • Working in a fully remote environment

Logistics

This is a one-year contract position funded by the Chan-Zuckerberg Initiative’s Essential Open Source Software program.  It is a fully remote position.  Your contract will be with NumFOCUS, a US-based non-profit, but you will work with Dask community members and life sciences practitioners across several different institutions.  There are no residency requirements.

Candidates with diverse backgrounds are especially encouraged to apply.

To Apply, please send your cover letter and resume to with the subject line: “Dask Life Sciences Fellow”