Dataset

Background on the Dataset

The dataset that will be used for this hackathon is a stem cell differentiation to cardiomyocytes with samples collected every day during the differentiation starting at the day cells were plated (Day -2) and ending with cardiomyocytes (Day 14). Day 0 indicates the start of the differentiation. Samples were fixed on the day of collection and then processed together with combinatorial indexing using Scale Biosciences technology.

Experimental Design

This data contains three stem cells lines (FSA0024I1, IST2607K3, WAB0137L8) that were cultured using a village platform meaning that all lines were cultured in the same dish and demultiplexed following sequencing by matching the genetic information in the sequencing reads to genotype data for each cell line. Each line is well represented across the days of differentiation:

Experimental Design

Details about the data metadata

This dataset only has a few different metadata fields. This icludes:

  • nCount_RNA: Number of UMIs identified per cell
  • nFeature_RNA: Number of unique features (genes) expressed per cell
  • Barcode: Cell barcode
  • CellLine: The cell line that cell is from (FSA0024I1, IST2607K3 or WAB0137L8)
  • Day: The collection day each cell was collected and fixed
  • Sex: The sex of the donor that the iPSC line was generated from
orig.ident nCount_RNA nFeature_RNA Barcode CellLine Day
SeuratProject 6642.99981199999 3462 AAACCATAGGCAGGTCCGTAGGTCAGCTT WAB0137L8 Day-1
SeuratProject 17163.0002339001 6789 AAACCATAGGCAGGTCCGTTAAGTCCTGA WAB0137L8 Day-1
SeuratProject 11131.0002034 5248 AAACCATAGGCAGGTCCGTTCCGGCTTAT FSA0024I1 Day-1
SeuratProject 18820.0002576 6886 AAACCATAGTCAACGTAAGAGCCGTAGTT FSA0024I1 Day-1
SeuratProject 15492.0002927 6469 AAACTCCAAACGCGAGATTGTAGCAGCTA WAB0137L8 Day-1
SeuratProject 15993.0000964 6711 AAACTCCAAGCAGGTCCGTCCGCTAAGAG WAB0137L8 Day-1
SeuratProject 20849.0000901 7491 AAATCGTTCGCAGGTCCGTACGGCGTTAA WAB0137L8 Day-1
SeuratProject 41359.9987672 9909 AAATTCCTCACGCGAGATTGTAGGCTGCA WAB0137L8 Day-1
SeuratProject 23723.999869 8025 AAATTCCTCGCAGGTCCGTTATTGCTGGA WAB0137L8 Day-1

Accessing the Dataset

The provided data has been demultiplexed with doublets removed resulting in 19,663 cells. The cell line labels for each cell is provided in the cell metadata sections of the provided objects and the 2024_hackathon_cell_metadata.tsv file. All data can be found in this directory. Links to individual data structure files so you can use download bash commands if desired:

We recommend downloading these before the hackathon and doing some quick preliminary assessments to make sure you have a good grasp on the data structure.

Oz Single Cell 2024 Github Organisation

We have generated a github organisation specifically for this hackathon so all code for each project can be stored in a central location. You should have received an invitation to join the Oz Single Cell 2024 Github Organisation but if you didn’t receive it please email d.neavin @ garvan.org.au. Create a repo within the organisation to store all code and scripts and get hacking!