FieldDB / FieldDB

An offline/online field database which adapts to its user's terminology and I-Language. http://fielddb.github.io
http://lingsync.org
79 stars 45 forks source link

Camera ready paper for computel #1470

Closed cesine closed 10 years ago

cesine commented 10 years ago

Cutting room:

\subsubsection{Development of novel tasks}

Computational Linguistics shared tasks traditionally focused on English news texts, branching out to EU languages. 

%http://www-nlpir.nist.gov/projects/duc/data.html
%http://www.nist.gov/tac/tracks/index.html

%http://trec.nist.gov/
%http://www.clef-initiative.eu/edition/clef2010/agenda
\begin{itemize}
    \item TIPSTER (DUC 2001-2005)
    \item TREC (DUC 2001-2005)
    \item AQUAINT (DUC 2007)
    \item Question Answering (TAC 2008, CLEF QA4MRE )
    \item Recognizing Textual Entailment (TAC 2008-2011)
    \item Summarization (TAC 2008-2014)
    \item Knowledge Base Population  (TAC 2009-2014)
    \item Knowledge Base Acceleration
    \item Morphological Segmentation (MorphoChallenge 2005-2010)
    \item Machine Translation (DARPA TIDES, OpenMT 2001-present)
    \item Topic Detection and Tracking (TDT)
    \item CIIR
    \item Clinical Decision Support
    \item Contextual Suggestion
    \item Federated Web Search
    \item Microblog
    \item User Session
    \item Temporal Summarization
    \item Blog 2010
    \item Chemical Information Retrieval  2011
    \item Cross-Language Information Retrieval (TREC 2002)
    \item Crowdsourcing (TREC 2013)
    \item Enterprise Search (TREC present)
    \item Entity (TREC 2011)
    \item Filtering new documents (TREC 2002)
    \item Genomics (TREC 2007)
    \item High Accuracy Retrieval from Documents  HARD (TREC 2004-2005)
    \item Interactive  (TREC 2003)
    \item Legal  (TREC 2012)
    \item Medical Records  (TREC 2012)
    \item Million Query  (TREC 2009) %improve the TREC Pooling which selects documents
    \item Novelty (TREC 2004)
    \item Question Answering (TREC 2007)
    \item Relevance Feedback (TREC 2002-2014)
%   \item Robust Retrieval  (TREC 2005)  %now used as a metric for other ad-hoc tasks
    \item SPAM (TREC 2007)
    \item Terabyte (TREC 2006)
    \item Video (TREC 2001-2002, TRECVID 2003-2014) %http://trecvid.nist.gov/
    \item Intelectual Property (CLEF-IP 2010)
    \item Plagarism Dectection (CLEF PAN 2010-2013)
    \item Log file analysis (LogCLEF 2010)
    \item Multiple Language Question Answering (CLEF MLQA 2010)
\end{itemize}

Knowledge base building 
\begin{itemize}
    \item Knowledge Base Population  (TAC 2009-2014)
    \item Knowledge Base Acceleration
    \item Federated Web Search
\end{itemize}

%http://ciir.cs.umass.edu/research/tides.html
%DARPA's two ultimate goals are (1) to build a machine translation system in one week for a new low-density language, and to be able to sort "communications" into relevant or not-relevant for 80% of the cases and (2) to demonstrate a question answering capability against real-time data feeds from both English and non-english sources.
cesine commented 10 years ago

submitted.

Those interested in reading the prepress version can see this file until we merge it into master:

https://github.com/cesine/FieldDB/blob/computel/docs/ComputEL/dunham%2Ccook%2Chorner-2014-LingSync_and_the_Online_Linguistic_Database_New_models_for_the_collection_and_management_of_data_for_language_communities_linguists_and_language_learners_prepress.pdf