Vlad777 / mit-stanford

MIT-Stanford MOOC mashup
2 stars 0 forks source link

Parsing of professor name #6

Open Vlad777 opened 11 years ago

Vlad777 commented 11 years ago

There are variations of the names of the professor that we need to extract just the name in order to use with ratemyprofessor or otherwise ratemyprofessor search doesn't work.

For ratemyprofessor, we'll also need the school associated. For example Prof. Christopher Cummins exists in 3 different universities.

Here are the variations I found so far from MIT and the following rules that would need to be implemented to clean them up:

  1. "Dr. " prefix has to be removed.
  2. "Prof. " prefix has to be removed.
  3. " Faculty Liaison" suffix that has to be removed.
  4. "Biological Engineering Faculty" is not a valid name.
  5. " (*)" suffix, of anything in parentheses at the end, has to be removed, for example: (Principal Instructor)
charslong commented 11 years ago

ok, prefix removal should be easy. thanks for bringing this to my attention.

ghost commented 11 years ago

I removed these prefixes. It seems like MITpages lags quite a lot though.