Omic Data
The machine learning portion of the pipeline requires measurements that is used to prioritize ambiguous pairs. The input to the pipeline is a path to a folder containing data matrices. Traditionally, RNAseq and proteome data is provided.
Data Format
The file should be a tab-separated file, with the first column being the protein/gene in Gene Symbol. The following columns are measurements in each sample. The value used for each gene is the mean measurement across all the included samples.
Accepted values for missing values are: NA
and blank.