helenginn / cppxfel

cppxfel source - I finished my doctorate. This program is FOSSILISED! Assume no more support!!
3 stars 2 forks source link

Regression in data quality #38

Open biochem-fan opened 8 years ago

biochem-fan commented 8 years ago

It seems that there are at least two problems. Sigma values are for Hg-SAD datasets (11,000 images).

Regression between Apr 26 and May 7

04701bf (Apr 26) is OK (15.66 then 25.17 sigma in cycle 0 and 1). b7143d4 (Apr 28) is OK (15.70 then 25.30) 2b2f4f5 (May 7) is not (13.46 then 21.31 sigma).

Bug in initial merging, introduced between May 7 and Jul 16

At commit 27027bb (Jul 16), the completeness was very low (~ 40%) because most images were rejected as having low CC. ANODE peak heights were < 10 sigma. In cycle 1, 90% of images were rejected. Usually, I get < 40 % rejected at cycle 1 and rejected images are recovered in later cycles as the internal reference improves.

With MC integrated intensities from CrystFEL as the INITIAL_MTZ, I got 22.26 sigma at cycle 0 with completeless 98%. Sigma values hardly improved in later cycles.

27027bb using reference from b7143d4 (Apr 28): 18.00 then 21.53 sigma

biochem-fan commented 8 years ago

Pr-SAD dataset, 10K lattices cppxfel 1b7e98c

hmeduyvesteyn commented 8 years ago

Interesting results! Just a quick request...if you have time, could you possibly try INITIAL_RLP_SIZE 0.00025 and INITIAL_RLP_SIZE 0.00035, both with a STEP_SIZE_RLP_SIZE 0.00004. Just curious to see if there is any pattern.

biochem-fan commented 8 years ago

Done.