Some Questions about vocab?

Dear friends,

I checked some vocab.txt under uncased_L-12_H-768_A-12, chinese_L-12_H-768_A-12, multilingual_L-12_H-768_A-12 and find some questions:

1. Many ##xxx in vocab.txt, for an example:

  $ cat uncased_L-12_H-768_A-12/vocab.txt | grep "##" | sort |  more
##at
##ata
...
   $ cat uncased_L-12_H-768_A-12/vocab.txt | grep -w "at"
at
##at

My question is: we have "at" in vocab.txt, why needs "##at" ? what does "##at" mean here ?

2. Many numbers are there in vocab.txt, for an example:

  $ cat uncased_L-12_H-768_A-12/vocab.txt | grep 0
    1609
690
1910s
840
1086
...

My question is: digit numbers are unlimited, is it reasonable putting them into vocab.txt?

3. Some common word missing in vocab, for an example: Word "fax" does not exists in uncased_L-12_H-768_A-12/vocab.txt,

  $ cat uncased_L-12_H-768_A-12/vocab.txt | grep fax
halifax
fairfax

Thanks your answer in advance.

google-research / bert