Open creisle opened 1 year ago
These all look good to me
Another weird case I am not sure what to do with
<sec><title>Title of a thing</title><p>paragraph content</p></sec>
becomes
<passage>Title of a thing</passage><passage>paragraph content</passage>
which makes less sense for bioc, but when it gets concatenated together tho as
Title of a thingparagraph content
We need whitespace between the two. Should we be adding a trailing single space or new line to the first passage when we parse the XML?
Another weird special case on the superscripts to add to the tests
Compared with <italic>KRAS</italic> wild type and empty vector controls, <italic>KRAS</italic> <sup>10</sup>G<sup>11</sup> and <sup>11</sup>GA<sup>12</sup> significantly enhanced in vivo tumor growth
should be
Compared with KRAS wild type and empty vector controls, KRAS 10G11 and 11GA12 significantly enhanced in vivo tumor growth
Input XML | Proposed Output | Current Output |
---|---|---|
The 2-year invasive disease-free survival rate was 93·9% |
The 2-year invasive disease-free survival rate was 93.9% | The 2-year invasive disease-free survival rate was 93*9% |
Since I've been going through these in such detail I've noticed a few cases where the output doesn't look like what I would expect but I want to clear them with you @jakelever before I make the appropriate changes. I've listed them in a table below
incubator containing 5% CO<sub>2</sub>
10<sup>4</sup>
especially in <italic>CBL</italic>-W802* cells
influenced by the presence of allelic variants—GSTP1 Ile<sub>105</sub>Val (rs1695) and <italic>GSTP1</italic> Ala<sub>114</sub>Val (rs1138272), with homozygote
breast cancer, clear cell renal carcinoma, and colon cancer<xref ref-type="bibr" rid="b6">6</xref><xref ref-type="bibr" rid="b7">7</xref> <xref ref-type="bibr" rid="b8">8</xref> <xref ref-type="bibr" rid="b9">9</xref> <xref ref-type="bibr" rid="b10">10</xref> have successfully identified
, and in the transgenic\nGATA-1,\n<sup>low</sup> mouse
we selected an allele (designated <italic>cic</italic><sup><italic>4</italic></sup>) that removes
regulation of the Wnt-β-catenin pathway
the specific HPV<sup>+</sup> gene expression
known to be resistant to 1<sup>st</sup> and 2<sup>nd</sup> generation EGFR-TKIS, osimertinib
at 37°C in a humidified 5% CO<sub>2</sub> incubator
seeded at concentrations below 1 × 10<sup>6</sup>/ml, selected
9 patients with a <italic>BRAF</italic>-mutant tumour
patients with <italic>BRAF</italic><sup>WT</sup> tumours
MSI<sup>hi</sup> tumours
upper limit of normal, creatinine clearance ⩾30 ml min<sup>−1</sup>,
the oncometabolite R(–)-2-hydroxyglutarate at the
[<sup>3</sup>H]-Thymidine