Search by EDAM concept ID - ES tokenizes on slash by default 🙏
I knew ES was tokenizing on blank space by default but not that it also tokenizes on slash. I wanted to allow users to search with only the last part of the class ID (e.g. "operation_3207") so it's nice that this feature is already implemented.
An example of challenge with a non-null operation value
http://localhost:9200/openchallenges-challenge-000001/_search?q=(name:%22Drug%20Sensitivity%20and%20Drug%20Synergy%20Prediction%22)
{
"took": 33,
"timed_out": false,
"_shards": {
"total": 1,
"successful": 1,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 1,
"relation": "eq"
},
"max_score": 20.742569,
"hits": [
{
"_index": "openchallenges-challenge-000001",
"_type": "_doc",
"_id": "4",
"_score": 20.742569,
"_source": {
"contributions": [
{
"organization_id": 1,
"role": "sponsor"
},
{
"organization_id": 52,
"role": "data_contributor"
},
{
"organization_id": 131,
"role": "sponsor"
},
{
"organization_id": 150,
"role": "data_contributor"
}
],
"created_at": "2023-11-01T22:08:36.000000000Z",
"description": "Development of new cancer therapeutics currently requires a long and protracted process of experimentation and testing. Human cancer cell lines represent a good model to help identify associations between molecular subtypes, pathways, and drug response. In recent years there have been several efforts to generate genomic profiles of collections of cell lines and to determine their response to panels of candidate therapeutic compounds. These data provide the basis for the development of in silico models of sensitivity based either on the unperturbed genetic potential of a cancer cell, or by using perturbation data to incorporate knowledge of actual cell response. Making predictions from either of these data profiles will be beneficial in identifying single and combinatorial chemotherapeutic response in patients. To that end, the present challenge seeks computational methods, derived from the molecular profiling of cell lines both in a static state and in response to perturbation of ...",
"doi": "",
"end_date": "2012-10-01",
"headline": "Predicting drug sensitivity in human cell lines",
"input_data_types": {
"name": "metabolomic",
"slug": "metabolomic"
},
"name": "Drug Sensitivity and Drug Synergy Prediction",
"operation": {
"class_id": "http://edamontology.org/operation_3207",
"preferred_label": "Gene methylation analysis"
},
"platform": {
"name": "Synapse",
"slug": "synapse"
},
"starred_count": 0,
"start_date": "2012-06-01",
"status": "completed",
"submission_types": {
"name": "prediction_file"
},
"_entity_type": "ChallengeEntity"
}
}
]
}
}
Closes #2550
Changelog
Preview
Search by EDAM concept ID
http://localhost:8000/challenge?searchTerms=http://edamontology.org/operation_3207
Search by EDAM concept ID - ES tokenizes on slash by default 🙏
I knew ES was tokenizing on blank space by default but not that it also tokenizes on slash. I wanted to allow users to search with only the last part of the class ID (e.g. "operation_3207") so it's nice that this feature is already implemented.
http://localhost:8000/challenge?searchTerms=operation_3207
BTW, this is how we can visualize how ES tokenize the property
operation.class_id
:Search by EDAM concept preferred name
http://localhost:8000/challenge?searchTerms=Gene%20methylation%20analysis
An example of challenge with a non-null operation value
Indexed EDAM concept in ES