Ariel Rokem
https://neuroinformatics.uw.edu
Department of Psychology
eScience Institute
Follow along at:
Allen Institute for Brain Science
Human Connectome Project (HCP), N = 1,200
Adolescent Brain Cognitive Development (ABCD),
N = 10,000
Healthy Brain Network (HBN), N = 10,000
UK Biobank, N = 500,000
New data sets will enable important new discoveries
Data aggregation and integration
Machine learning and data mining
Data visualization and communication
Insights into the brain basis of complex behaviors
Personalized medicine
Brain connections develop and mature with age
Individual differences account for differences in behaviour
Adapt and change with learning
Brain network health is important for mental health
The 3D structure of each brain is unique
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
The tracts are the coordinate frame for quantitative analysis
Neurodegenerative disease
Affects motor neurons
Etiology varies widely
Patient/Control?
Classification accuracy of 93% (+/- 2%)
AUC of 0.978 (+/- 0.01)
Allen Institute for Brain Science
Human Connectome Project (HCP), N = 1,200
Adolescent Brain Cognitive Development (ABCD),
N = 10,000
Healthy Brain Network (HBN), N = 10,000
UK Biobank, N = 500,000
On the one hand:
But on the other
Enabling technology
But:
Reap the benefits of AWS Batch without leaving our development environment
For example: Jupyter notebook running locally on our laptop
import cloudknot as ck
def awesome_func(...):
...
knot = ck.Knot(func=awesome_func)
import cloudknot as ck
def awesome_func(...):
...
knot = ck.Knot(func=awesome_func)
...
future = knot.map(args)
Compare to Dask, Myria, Spark using previous benchmark study
(Mehta et al., 2017).
import cloudknot
knot = cloudknot.Knot()
results = knot.map(sequence)
Github repo: https://github.com/nrdg/cloudknot
Documentation: https://nrdg.github.io/cloudknot/index.html
We welcome issues and contributions!
Results from large multi-dimensional datasets are hard to understand
Hard to communicate
Hard to reproduce
A web-based application
Leverages modern visualization frameworks
Builds a web-site for a diffusion MRI dataset
Automatically uploads the website to GitHub
Enhances published results
Linked visualizations facilitate easy exploration
Enables new discoveries in old datasets
Generates hypotheses for new research
MRI data analysis requires specific expertise
Tract segmentation and tractometry generates data in a tidy table format (CSV)
Facilitates interdisciplinary collaboration