Pipeline
Pipeline helper functions.
parsimony_only(data_set, full_prioritization=False)
Run the XLRanker pipeline with only the parsimonious selection step.
This will likely result in many PARSIMONY_AMBIGUOUS protein pairs. To avoid ambiguity, you can set full_prioritization to True. This will select one random pair as the representative pair for ambiguous groups.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
data_set
|
XLDataSet
|
Cross-linking dataset that needs prioritization |
required |
full_prioritization
|
bool
|
Default to False. If True, randomly select representative pairs for ambiguous groups. |
False
|
Returns:
Name | Type | Description |
---|---|---|
XLDataSet |
XLDataSet
|
XLDataSet with only parsimonious selection performed. |
Source code in src/xlranker/pipeline.py
run_full_pipeline(data_set, threshold=0.5)
Run the full XLRanker pipeline.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
data_set
|
XLDataSet
|
Cross-linking dataset that needs prioritization |
required |
threshold
|
float
|
Score threshold for the expanded report |
0.5
|
Returns:
Name | Type | Description |
---|---|---|
XLDataSet |
XLDataSet
|
XLDataSet with full prioritization |