mslehre / text-embedding

LLMs and embeddings for Uni GW texts
2 stars 0 forks source link

sK: FIS data import #59

Open MarioStanke opened 1 year ago

MarioStanke commented 1 year ago

Adapt the new data from Forschungsinformationssystem for our existing code and data

MarioStanke commented 1 year ago

Structure of tables:

table name: column name 1, column name 2, ... the columns are tab-separated

Table inst_abbrev.tbl: long_name, short_name The long_names are unique in this table.

The short name is for example Physik instead of Institut fuer Physik.

Table inst_color.tbl: short_name, color

The color is a consecutive integer. The short_name is unque in this table.