HarounH / nlp-col864

0 stars 0 forks source link

[21st feb] discussion Step 1: Clean DSTC for template

    Step 2:
        Multiple tasks

    Baselines:
        1. 2017 paper - upper limit.
        2. Lower limit - simple memnn using cleaned up data.
        3. Seq2Seq models are not as good as memnn, so we don't implement them.

    -- Interesting question: LSTMs vs MemNN.

    Template induction:
        1. understand code.
        2. email dipanjan das for code. 

        Data for templates:
            try subreddit for getting moar templates.

            Multilabel NER - template induction would work and provide a single field.
            Even if we want to get templates where each slot can have only one field, 
                where do we get such data???

    Transactional conversations:
        We dont have any.

    Once we have templates, we have to do 2 things:
        JUGAAD:
            - create a model that picks a template given {u_i}
                = can use lower
            - slot filling code.

        - IDEAL: joint inference? slot filling + template picking at the same time.
            = entity extraction
            = intent extraction
            C&C paradigm...
                = microsoft at interspeech conf. ... entity intent joint modeling. (we're doing the exact opposite.)
                = still read about it. get some ideas.

todo:
    HH:
        0. read template induction... get code.
        1. hpc set up. (tomorrow) - TF + tutorials...
        2. cleanup code onto repo...
        3. look at dstc data.
    DR:
        1. start using the repo.