Launching Data Carpentry: Image Processing with Python
I am thrilled to announce the first official release of the new Data Carpentry curriculum, Image Processing with Python. In November 2022, the Curriculum Advisory Committee met and approved a motion to move the lesson out of beta testing and into stable status. Version 1.0.0 of the curriculum was published to Zenodo last week. This means the curriculum is now available for teaching in official Data Carpentry workshops.
Curriculum Content
The Image Processing with Python curriculum teaches the concepts and skills needed to write automated processing pipelines for image data. Using examples from morphometrics, the curriculum covers the steps of a typical image processing workflow: loading images, masking, blurring and thresholding data, and ends with a capstone challenge to provide a chance for additional practice. Like Foundations of Astronomical Data Science, released in April 2022, this curriculum consists on a single lesson, designed to be taught over two full days or four half-days.
The curriculum uses a variety of example images that do not require any domain-specific knowledge. However, it does require learners to have gained some familiarity with the Python programming language, such as by attending a Software Carpentry, Data Carpentry: Ecology or Data Carpentry: Social Sciences workshop.
Want to Teach the Curriculum?
Image Processing with Python can now be taught in self-organised and
centrally-organised Data Carpentry workshops.
Set up your own workshop, or make sure you are subscribed to
the instructors
TopicBox list
to hear about opportunities to teach the lesson.
Some of the lesson Maintainers hosted an Instructor Onboarding session at CarpentryCon@Home 2022, which you can watch on The Carpentries YouTube channel.
Host a Data Carpentry: Image Processing Workshop
You can now request a workshop teaching the Image Processing with Python curriculum from The Carpentries, or organise and host your own. We are expecting high demand for these workshops, and Centrally-Organised workshops (those administered by The Carpentries) will initially be limited in number while we assess the Instructor community’s capacity for teaching this new curriculum. The Workshop Administrator will work with every request on a case by case basis.
Curriculum Background and Acknowledgments
The lesson was originally developed for
the Digital Imaging and Vision Applications in Science (DIVAS) project,
where it was taught to undergraduates majoring in natural science.
This initial lesson design and development was led by
Prof. Mark Meysenburg, Prof. Tessa Durham Brooks, Dr. Raychelle Burks, Dr. Erin Doyle and colleagues
at Doane University, Nebraska, USA,
and used cv2
, the Python interface to OpenCV,
as its main toolkit for image processing.
In an effort led by Dominik Kutra and Dr. Constantin Pape of the European Molecular Biology Laboratory (EMBL), the lesson was then converted to use scikit-image as the primary library, and was taught in alpha pilots at EMBL and The University of Arizona.
In June 2021, Dr. Kimberly Meechan, David Palmquist, Prof. Ulf Schiller, and Dr. Robert Turner answered a call for volunteers to help make these changes to the curriculum, and this group has done excellent work to prepare the lesson for a beta release.
In May 2022, we published a call for beta pilots to help polish the curriculum and ensure it was ready to be taught by The Carpentries global Instructor community. Answering this call, the University of Würzburg and the Earlham Institute hosted beta pilot workshops and provided feedback and improvements to the lesson repository.
The first Curriculum Advisory Committee for Data Carpentry: Image Processing was established shortly after that call for beta pilots was published, and includes members with experience and expertise in image processing from a wide range of domains. The members of the CAC are:
- Ulf Schiller (Chair)
- Josh Moore (Secretary)
- Marianne Corvellec
- Erick Martins Ratamero
- Sunil Shende
The current Maintainer team is:
It has been a long road for this curriculum to get to this point, where we can finally make a stable release and share it with the Instructor community. I am so grateful to all the people mentioned above, and to the many other community members who have contributed to the project along the way. If you have taught the lesson, hosted a workshop for the curriculum, opened an issue or pull request in the repository, or shared the lesson site with a friend or colleague, thank you! Additional thanks go to my colleagues in the Curriculum Team, Erin Becker and Zhian Kamvar, for the role they have played in the development of the curriculum and the preparations for its release. I am excited to see what the Data Carpentry community does with the curriculum from here, and how it evolves in the coming years.