NVIDIA / NeMo-text-processing

NeMo text processing for ASR and TTS
https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/nlp/text_normalization/wfst/wfst_text_normalization.html
Apache License 2.0
258 stars 84 forks source link

Northern Sámi TN #11

Closed jimregan closed 11 months ago

jimregan commented 1 year ago

What does this PR do ?

Text Normalisation for Northern Sámi, for Språkbanken Tal

Collection: [Note which collection this PR will affect]

Changelog

Usage

# Add a code snippet demonstrating how to use this 

Before your PR is "Ready for review"

Pre checks:

PR Type:

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed. Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

jimregan commented 1 year ago

Reviewer 1 of a failed paper attempt wanted to point out this: https://github.com/giellalt/lang-sme/tree/main/src/transcriptions

It's a bit of a stretch to describe it as a text normalisation system -- pieces towards a demo might be more accurate -- and good luck getting it to build following the instructions in the README.

I used this Dockerfile, but, limited as it is, it wasn't worth the trouble:

FROM ubuntu:23.04

WORKDIR /tmp

RUN apt update

RUN apt-get install -y cg3 hfst wget git autoconf subversion gawk icu-devtools make

RUN wget https://apertium.projectjj.com/apt/install-nightly.sh -O - | bash

RUN git clone https://github.com/giellalt/giella-core && \
    cd giella-core && \
    ./autogen.sh && \
    ./configure && \
    make

WORKDIR /tmp

ENV GIELLA_CORE=/tmp/giella-core

RUN git clone https://github.com/giellalt/lang-sme && \
    cd lang-sme && \
    ./autogen.sh && \
    ./configure && \
    make
github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 1 year ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 12 months ago

This PR is stale because it has been open for 14 days with no activity. Remove stale label or comment or update or this will be closed in 7 days.

github-actions[bot] commented 11 months ago

This PR was closed because it has been inactive for 7 days since being marked as stale.