acl-org / acl-anthology

Data and software for building the ACL Anthology.
https://aclanthology.org
Apache License 2.0
438 stars 297 forks source link

Ingestion Request [05-14-2024]: LREC-COLING 2024 #3175

Closed fcbond closed 4 months ago

fcbond commented 7 months ago

General information about this request

Venue Identifier

lrec, coling

Volume Title

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

Venue Name (only if you are submitting a new venue)

No response

Venue Website (only if you are submitting a new venue)

No response

Date of Publication

2024-05-14

Supporting Information

Hi,

I am in charge of this, along with @arademaker. It is going to be massive, over 1,558 papers in the main conference, plus tutorials and 36 workshops (originally 38, one was merged one was cancelled).

  1. There are many workshops, 30 with existing venues, 7 with new ones (some are joint). We attach a spreadsheet with the complete list, and a summary here. Do we need an issue for each new workshop venue?
    • W9 is joint between FinNLP, econlp and KDF (KDF is not in venues)
    • W10 is joint between MWE and UD
  2. We have ISBNs for the proceedings, tutorials and all the workshops (from ELRA) included in the attached spreadsheet.
  3. Coling and LREC both have ISSNs, we would like to use both: 2951-2093, 2522-2686 [not in the ACL anthology metadata although it would be good to have: is there anywhere to note this?]

Please help us with this mammoth undertaking!

Francis @fcbond

New workshop venues:

neusymbridge
delite
determit
dlnld
htres
polp
rfp
safei

All workshop venues (two workshops are joint):

rapid
neusymbridge
delite
determit
dlnld
games
htres
finnlp
mwe
udw
legal
mathnlp
nlperspectives
parlaclarin
polp
politicalnlp
rfp
cawl
sigul
signlang
bucc
isa
eurali
readi
econlp
osact
wildre
ldl
dmr
rail
humeval
ecnlp
unlp
safeai
cogalex
lt4hala
trac
tdle

Proceedings titles for LREC-COLING 2024 final.xlsx

helenemazo commented 7 months ago
  1. Workshops venue: the venue is the same for the conference and all workshops/Tutorials = Lingotto Conference Center in Turin (Italy)
  2. The ISBN for the tutorials should be: 978-2-493814-35-7 (to be confirmed)
  3. The ISSN number goes with the ISBNs. Please use the LREC ISSN: 2522-2686
  4. Let me check this with Khalid and come back to you. Best, Hélène
fcbond commented 7 months ago

venue is the acl-anthology term for the series the proceedings is associated with: https://aclanthology.org/venues/

I am checking with the ISSN authority if we can use both ISSNs.

Also, it looks as though two workshops have merged, I will send a new spreadsheet as soon as I know the details.

fcbond commented 7 months ago

We want to use both ISSNS: 2522-2686, 2951-2093

I have verified that this is ok:

Yours is a particular case, you can identify this year joint proceedings with both ISSN. As it is a one-shot joint conference, a new ISSN assignment is not neccesary.

We havea added a note to the LREC Proceedings record: 2024 conference, entitled "Joint International Conference on Computational Linguistics, Language Resources and Evaluation" held jointly with "International conference on computational linguistics" ISSN 2951-2093, that clarifies the issue.

From the ISSN International Centre Metadata and Technical Coordination of the ISSN Network Department

arademaker commented 7 months ago

What acronym should we use for the LREC-COLING 2024? Both events have their acronyms:

  1. https://aclanthology.org/venues/lrec/
  2. https://aclanthology.org/venues/coling/

Eventually, they will become separated again. Does it make sense to have one new acronym for the joint event? but the links above will both miss the 2024 edition.

mbollmann commented 7 months ago

What acronym should we use for the LREC-COLING 2024? Both events have their acronyms:

Proceedings can have multiple venues assigned to them, so it can appear on both.

fcbond commented 7 months ago

Thank you. When we use ACLpub in softconf, should the abbreviation be the venue name, or is that unconnected? If they are connected, how do we write mulitple cenues? lrec,coling?

arademaker commented 7 months ago

Hi @mbollmann, can you answer the @fcbond question? We need to finish the proceedings of LREC-COLING 2024 as soon as possible.

mbollmann commented 7 months ago

I know what it can/should look like in our XML, but I don’t know anything about ACLpub, unfortunately. Maybe @mjpost knows.

mbollmann commented 7 months ago

You probably shouldn’t see the venue question as a blocker, though, since it’s an easy one-line change in our XML if it turns out wrong (I hope @mjpost or @anthology-assist won’t crucify me for saying this :)). ACL-IJCNLP 2021 is an example for how this will look in the end.

fcbond commented 7 months ago

Thanks @mbollmann,

We will try to get the files to you as quickly as possible, so that you can look at issues there.

There is one question it would be very helpful to answer now. We have 5 new venues. Do we need an issue for each new workshop venue? Or is this single issue enough?

There have been a couple of changes, I will update the original issue to match the current state (two workshops got merged, we have multiple ISSNS, etc.)

Thanks again for your help,

mbollmann commented 7 months ago

What I do realize now (sorry for spamming multiple messages) is that the Anthology IDs will only reflect one of the venues though; as for example ACL-IJCNLP papers have 2021.acl-* IDs. Do you have a strong preference for having both venues in the ID itself?

mbollmann commented 7 months ago

There is one question it would be very helpful to answer now. We have 5 new venues. Do we need an issue for each new workshop venue? Or is this single issue enough?

It will be easiest to compile them all under this one issue.

fcbond commented 7 months ago

Hi,

I think the id is just an ID, we don't have a strong preference (at least I don't).

mjpost commented 7 months ago

Hi, thanks for tagging me, I am just seeing this.

One issue is fine as @mbollmann mentions.

As for venue ID (lrec vs. coling), we can associated the proceedings with both venues, so that it will appear jointly on both pages. But you have to choose one, which will be used in the filename.

anthology-assist commented 6 months ago

Just waiting on the final venue ID and the ingestion material.

fcbond commented 6 months ago

Thank you!

For the venue ID, let's go with lrec.

The ingestion material is very close --- we are waiting on the ELRA foreword and one workshop.

We hope to get it to you within two days.

When it is ready, can we pass you a google drive folder with the .tgz files? Or how would you like it, ...

Yours,

arademaker commented 6 months ago

I remember we can now make a PR with the files, right?

anthology-assist commented 6 months ago

@arademaker @fcbond When ingestion material is ready, you can share the google drive folder link with the .tgz files directly here!

arademaker commented 6 months ago

Hi @anthology-assist

here are the links

  1. all papers here
  2. tutorials here
  3. link for the google drive with the workshops proceedings here

We would like to have the event listed in both

  1. https://aclanthology.org/venues/lrec/
  2. https://aclanthology.org/venues/coling/

For the event itself, we expect something like

https://aclanthology.org/events/lrec-2022/ listing

  1. the proceedings of the papers
  2. the proceedings of the tutorials
  3. the 36 workshops from the Google Driver (see that 03 and 25 were cancelled)
arademaker commented 6 months ago

Can you guys start the ingestion? We have 10 days for the conference: https://lrec-coling-2024.org/. It would be really nice if people can have the files in the ACL Anthology during the conference.

anthology-assist commented 6 months ago

@arademaker See inline response.

Hi @anthology-assist

here are the links

  1. all papers here

This link does NOT work.

  1. tutorials here
  2. link for the google drive with the workshops proceedings here

So far I found W03 is missing and W07 meta file is wrong. Could you fix them?

We would like to have the event listed in both

  1. https://aclanthology.org/venues/lrec/
  2. https://aclanthology.org/venues/coling/

For the event itself, we expect something like

https://aclanthology.org/events/lrec-2022/ listing

  1. the proceedings of the papers
  2. the proceedings of the tutorials
  3. the 36 workshops from the Google Driver (see that 03 and 25 were cancelled)
fcbond commented 6 months ago

Hi,

Hi @anthology-assist https://github.com/anthology-assist

here are the links

  1. all papers here https://softconf.com/lrec-coling2024/papers/pub/aclpub/proceedings.tgz

This link does NOT work.

Sorry, I think softoncf rebuilt it one last time (which takes over an hour). It is there now.

  1. tutorials here https://softconf.com/lrec-coling2024/tutorials/pub/aclpub/proceedings.tgz
  2. link for the google drive with the workshops proceedings here https://drive.google.com/drive/folders/1vBHJGWrgxWyAo5HgvzE6QGnbt1Xrt-ES?usp=sharing

So far I found W03 is missing. Could you double check all workshops are present?

W03 was cancelled. I will update the list.

We would like to have the event listed in both

  1. https://aclanthology.org/venues/lrec/
  2. https://aclanthology.org/venues/coling/

For the event itself, we expect something like

https://aclanthology.org/events/lrec-2022/ listing

  1. the proceedings of the papers
  2. the proceedings of the tutorials
  3. the 36 workshops from the Google Driver (see that 03 and 25 were cancelled)

Yours,

-- Francis Bond https://fcbond.github.io/

fcbond commented 6 months ago

I have updated the information and the attachment in the first comment.

arademaker commented 6 months ago

dear @anthology-assist

Here is the list of the links for the proceedings updated with all meta file revised.

Tutorials and main papers:

https://softconf.com/lrec-coling2024/tutorials/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/papers/pub/aclpub/proceedings.tgz

Workshops. We are still working to have 3 remaining workshop files that will need to be produced offline by their organizers. We will update here as soon as they send us the new version.

https://softconf.com/lrec-coling2024/cawl2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/cl4health2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/cogalex2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/determit2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/dlnld2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/dmr2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/ecnlp-7/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/eurali2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/finnlp-kdf2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/gamesandnlp2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/htres2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/humeval2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/isa2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/ldl2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/legal2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/lt4hala2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/mathnlp2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/mwe-ud2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/neusymbridge2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/nlperspectives2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/osact2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/parlaclarin-iv/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/politicalnlp2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/rail2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/rapid2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/readi2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/reference-framing-perspective2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/safeconvai2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/sigul2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/tdle2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/trac2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/unlp2024/pub/aclpub/proceedings.tgz
https://softconf.com/lrec-coling2024/wildre-7/pub/aclpub/proceedings.tgz
anthology-assist commented 6 months ago

@fcbond The main and tutorials volumes, and most of the workshops have been ingested here #3293. Please take a look at the preview and let us know if anything looks wrong (you can leave comment directly on that PR).

There are several workshops contain wrong meta file information hence they aren't ingested -- W7, W8, W9, W13, W16, W17, W38. Please correct them.

fcbond commented 6 months ago

Thank you!

We had noticed some errors, and have been correcting them all weekend. When they are done we will let you know.

We will leave new comments on the PR.

arademaker commented 6 months ago

@anthology-assist what are the errors? I have just fixed all the URL field and one shortbooktitle field. We are still waiting 04, 20 and 21, the only ones we can't fix ouselves in the softconf system.

arademaker commented 5 months ago

according to https://aclanthology.org/faq/

How does the ACL Anthology use Digital Object Identifiers (DOIs)? ... Note while the ACL Anthology hosts third-party materials coming from sister societies, these materials are hosted courtesy the ACL but are not assigned DOIs by the ACL due to costs and copyright limitations; the DOI information above is only applicable to ACL sponsored events by ACL, its chapters or SIGs.

I guess LREC, COLING and many of its workshops would be eligible for getting DOI, right? What is the proper way to add DOI for all papers in the LREC-COLING 2024 and its workshops?

mjpost commented 5 months ago

DOIs cost $1 to mint. We can do this for you under the Anthology identifier and arrange for you to reimburse ACL. A simpler process would be for you to generate them and then give us a list of (Anthology ID, DOI) pairs, that we could then add to the papers.

We use the following two scripts to generate DOIs that may be helpful to you if you choose route 2:

fcbond commented 5 months ago

Thanks @mjpost, we will check with the general chairs and get back to you as soon as we can.

marcschulder commented 5 months ago

It appears two workshops got ingested but didn't make it onto the event's list of volumes: signlang and BUCC are missing from lrec-2024 and coling-2024

arademaker commented 5 months ago

Dear all,

In the first page of https://aclanthology.org/2024.mwe-1.0.pdf my surname is incorrect "Rademacher" instead of "Rademaker". Can you guys fix? Possible recompiling the PDF?

Error in the title page and page IV.

arademaker commented 5 months ago

The new version of the MWE-UD proceedings is here:

https://drive.google.com/file/d/1IrGrkcdVv8BOgP5nCy--Cginu93NE8eY/view?usp=drive_link

anthology-assist commented 4 months ago

@arademaker If the changes only occur on two pages within 1 pdf, we do not need to perform a reingestion. Could you submit a correction ticket for the name spelling update?