dariusk / NaNoGenMo-2015

National Novel Generation Month, 2015 edition.
341 stars 21 forks source link

An Anthology of Fake Speeches #46

Open BrianHicks opened 8 years ago

BrianHicks commented 8 years ago

I haven't done a whole lot of generative text, but I've got a couple ideas:

That said, I'm not too sure about the domain here, so if any of those sound easier or harder for someone with little computational linguistics experience to tackle, I'd love the feedback.

ikarth commented 8 years ago

Those are all good ideas. I'd guess that how easy they are depends on which aspect of text generation you want to dive into. I'd rate them as going from hardest to easiest, though that's partially colored by my biases:

Either way, looking forward to seeing what you come up with.

cpressey commented 8 years ago

I like the speeches idea, probably because speeches are heavy on rhetoric. It would be interesting to see how much (if any) of that "oratory voice" remains after combining multiple speeches (even ones on vastly different topics) in some way.

enkiv2 commented 8 years ago

There have been prior examples of trying to generate speeches that have been fairly successful. I specifically recall someone using char-rnn with all of Obama's speeches, and again with TED talks; the TED talks ended up being less coherent in terms of topic but otherwise very stylisitically similar to normal TED talks, and comparable in the amount of content to the low end of TED talks: https://medium.com/@samim/ted-rnn-machine-generated-ted-talks-3dd682b894c0

This indicates that, with a larger corpus (say, the set of all state of the union addresses -- which are in the public domain & available from several sources), you could probably produce significantly more complex & interesting speeches.

On Fri, Oct 30, 2015 at 7:53 AM Chris Pressey notifications@github.com wrote:

I like the speeches idea, probably because speeches are heavy on rhetoric. It would be interesting to how much (if any) of that "oratory voice" remains after combining multiple speeches (even ones on vastly different topics) in some way.

— Reply to this email directly or view it on GitHub https://github.com/dariusk/NaNoGenMo-2015/issues/46#issuecomment-152505133 .

ikarth commented 8 years ago

Not to mention you can throw in, say, Mark Twain's speeches if you're looking for a less serious result.

There is, on the other hand, a certain purity to the idea of the Ur-State-of-the-Union speech.

BrianHicks commented 8 years ago

Thanks all, I'm going to give the fake speeches idea a shot! I'll update the title of my issue.