allenai / scholarphi

An interactive PDF reader.
Apache License 2.0
416 stars 52 forks source link

v2.0 system for definition extractor #136

Closed dykang closed 3 years ago

dykang commented 4 years ago

Develop v2.0 system for doc-level user interfaces and doc-level evaluation

(See dykang/definition_extractor_v2.0)

dykang commented 4 years ago
dykang commented 4 years ago

From @andrewhead: The potential gold protologisms in 2002.04138

dykang commented 4 years ago

A couple of feedback about nickname extractor from @andrewhead

dykang commented 4 years ago

Manually collected set of definitions in 1909.13433 by @andrewhead .

Term Sentence ID Body
deep amortized clustering 44304 a neural architecture which learns to cluster datasets efficiently using a few forward passes
DAC 44304 deep amortized clustering
DAC 43915 Deep Amortized Clustering
DAC 44305 implicitly learns what makes a cluster, how to group data points into clusters, and how to count the number of clusters in datasets
MoG 43909 Mixture of Gaussians
Neural Processes 43914 meta-learns from multiple datasets to learn a prior over functions
ST 43782 Set Transformer
Set Transformer 43782 a permutation-invariant set-input neural network that uses self-attention operations as building blocks
MAB 43784 Multihead Attention Block
PMA 43793 Pooling by Multihead Attention
ACT 44365 Adaptive Computation Time
filtering 44368 a forward pass through $f$ that takes a set $X$ and outputs a parameter $\theta$ to describe a cluster along with a membership probability vector $\mask \in [0, 1]^{n_{\ssc{X}}}$ where $n_X$ is the number of elements in $X$
MAF 44418 Masked Autogregressive Flow
VAE 44419 Variational Autoencoder
Learning to cluster 43821 the task of learning a clustering algorithm from data
NCP 43831 Neural Clustering Process
Neural Clustering Process 43831 an amortized clustering method
VBDPM 44161 Variational Bayesian Dirichlet Process Mixture Model
LL 44337 log-likelihood
ARI 44337 adjusted Rand index
NMI 44337 normalized mutual information
NS 44211 Neural Statistician
"DEC 44228 Deep Embedded Clustering
Omniglot dataset 44243 handwritten characters from 50 different alphabets
NMI 44255 normalized mutual information
DAC 44026 an approach to amortized clustering using set-input neural networks
dykang commented 4 years ago

Target papers for general user study

Target papers for comparison study

dykang commented 3 years ago

Closing this issue as we approach merging #146 to master. The remaining features mentioned here will be merged to v3 system or deprecated due to the lack of usefulness.