You can teach yourself with the materials for all of our regular classes and workshops.
Browse our classes and workshops:
Data Science Programming (e.g., Intro R, Intermediate Python, Intro to SQL)
Scalable Computing (e.g., Intro to Command Line, Cluster 101)
Reproducible Research (e.g., Intro to Git and GitHub)
Data 4 All (e.g., Better Spreadsheets, Better Tables, Better Plots)
These courses are designed for self-study; there is no corresponding class or workshop. But you should still feel free to reach out to us with any questions!
Short tutorials for Python and R that run in your browser, covering cool packages that help you work with data effectively.
Learn cool Python and R tricks for working with data.
Workflows help bioinformaticians analyze data efficiently on the cluster. They are better than bash scripts! Learn about Workflow Development Language (WDL), a better alternative to SnakeMake.
Analyze data more efficiently with workflows
We offer three self-directed courses on how to write workflows in Workflow Description Language (WDL). We prefer WDL to SnakeMake -- ask us why!
Labs work better when labmates do code review. Code review is essentially proofreading each others’ code.
Code review helps labs prevent errors together
Leading a lab with novice or experienced codewriters or users? Either way, check out our Code Review materials. We have tips on best practices for code review for labs of varying experience.
NIH Wants you to share your data because it is better for science. But you need to keep some information confidential! What do you do?
Learn how to comply with NIH data sharing policies
We have created a gudie that walks you through the process of complying with the NIH data sharing policy of 2023. We have also created a helper app.
DaSL’s affiliate scientists create a variety of training materials geared more broadly towards the biomedical research community, and designed to allow you to teach yourself.
Explore all DaSL and DaSL affiliate course materialsAnVIL is a computing platform that enables researchers to analyze controlled access data sets in a cloud computing environment. Our training materials will help you use AnVIL for data exploration and analysis on various bioinformatics topics. We also provide teaching support materials.
The DataTrail courses are free and designed to help those with less familiarity with computers and technology become savvy data scientists. It includes the technological data science fundamentals but also information on how to network and other accompanying and necessary skills for jobs in data science.
The ITCR Training Network is an effort to catalyze cancer informatics research through training opportunities. It has online courses that are available for free and/or for certification, but also hosts synchronous training events and workshops related to data science in cancer research. Links to all the current ITCR courses can be found here.
The BioDIGS materials are described here in several lines of blurb.The activities in this guide are written for undergraduate students and beginning graduate students. Some sections require basic understanding of the R programming language, which is indicated at the beginning of the chapter.