jhclark / ducttape

A workflow management system for researchers who heart Unix.
http://jhclark.github.com/ducttape
Other
119 stars 14 forks source link

Discussion: Removing "baseline" branches from names #62

Closed jhclark closed 12 years ago

jhclark commented 12 years ago

Consider this task from the cdec_kftt.tape example:

task align_dir : cdec
                < src=(AlignDir: s2t=$f_train@download t2s=$e_train@download)
                < tgt=(AlignDir: s2t=$e_train@download t2s=$f_train@download)
                > align=align.directional.gz {                                                                                                  
  paste $src $tgt | sed 's/\t/ ||| /g' > corpus                                                                                                      
  $cdec/training/model1 -A -d -v corpus | gzip > align.directional.gz
}

While there are 2 realizations here, which will be used by symmetrization via grafting, the directory names will be a bit unintuitive:

This is because the first branch of each branch point is considered the baseline. And baseline branches are omitted from directory names for 2 reasons:

However, this isn't terribly intuitive. This should be well-documented.

Perhaps there's another way of making this clearer to the user as well. Thoughts?

dowobeha commented 12 years ago

@jhclark, does the resolution of #96 also resolve this issue?

jhclark commented 12 years ago

Yes it does.