speaker-recognition Search Results

1000+ results
for speaker-recognition

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Azure-Samples/cognitive-services-speech-sdk #2101

Speech recognition quality got worse?

In the recent 1-2 weeks many of my users are reporting a decrease in speech detection quality, and I am struggling to understand what that could be. I noticed an increase in "SpeechNotRecognized" even…

Trunksome updated 4 months ago
20
eclipse-archived/smarthome #1093

Semantic tagging

ESH has a basic tagging implementation, which allows to add tags (as simple strings) to items. The idea behind this is to assign items a semantic. So while the "category" refers to a taxonomy (e.g. th…

kaikreuzer updated 6 years ago
95
errata-ai/Microsoft #4

Don’t spell out the term if the acronym is listed in The Ame…

jdkato updated 4 months ago
2
QwenLM/Qwen-Audio #32

请问prompt要怎么写才能获得单个task的信息或者想要的task的信息？

我这边想进行情感识别时，将prompt='{audio_url}'时，出来的结果是： assets/audio/1.wav普通话, 女声, 31岁今天天气真好可以看到上面的结果，包含普通话，性别，年龄和文本，但是就是没有情感，那么写prompt的时候，要怎么写才能获得单个task的信息或者想要的task的信息。

wjyfelicity updated 7 months ago
2
tc39/proposal-intl-segmenter #133

Custom Dictionaries

ICU's BreakIterator has clear limitations in its approach for character-based languages without textual word boundaries. When used directly, it allows you to specify a dictionary to work around limita…

nathanhammond updated 3 years ago
32
pyannote/pyannote-audio #1668

AttributeError: 'Annotation' object has no attribute 'for_js…

### Tested versions Library | Version | |:----------------|:-------:| Python | 3.12.2 | Pyannote.audio | 3.1.1 | Pyannote.core | 5.0.0 | ### Sys…

alvynabranches updated 6 months ago
3
prettier/prettier #4801

formatWithCursor performance bottleneck

I've isolated a bottleneck from our production environment and here's a nifty self-contained benchmark for it: https://gist.github.com/tmcw/1a4e8ee47941454337dc5952dbf90180 (swap require('./') for req…

tmcw updated 9 months ago
15
microsoft/cognitive-services-speech-sdk-js #818

[Bug]: JS SpeechSDK.AudioConfig.fromDefaultMicrophoneInput c…

### What happened? Hi Team, I'm using JS SDK capturing the speech using SpeechSDK.AudioConfig.fromDefaultMicrophoneInput, If the teams/zoom call is going on through the desktop app, teams/zoom ca…

ru4sam326 updated 4 months ago
8
octimot/StoryToolkitAI #147

Advanced Voice Recognition and Tagging System for Multi-Spea…

This enhancement will be particularly beneficial for transcribing meetings, interviews, gaming sessions, and podcasts involving multiple speakers, enabling users to distinguish who is speaking at an…

chris-hughes1 updated 8 months ago
6
SEACrowd/seacrowd-datahub #366

Create dataset loader for Kheng.info Speech

Dataloader name: `kheng_info_speech/kheng_info_speech.py` DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?kheng_info_speech | Dataset| kheng_info_speech | |-------------|---…

SamuelCahyawijaya updated 7 months ago
1

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for speaker-recognition

1000+ results
for speaker-recognition