StevenLOL / LIUM

Scripts for LIUM SpkDiarization tools
MIT License
31 stars 8 forks source link

ILP clustering - JDK 1.8 #1

Open libeppi opened 10 years ago

libeppi commented 10 years ago

Hello, student here, your script is great! It really helped me better understand speech segmentation. I'm able to run it till ILP-clustering (last step), when i try to go ahead this is what i get:

11:42.567 TrainIVectorOrTINFO | Compute statistics {make() / 1} 11:42.731 TrainIVectorOrTINFO | Train i-vector {make() / 1} 11:42.732 TotalVariabilitINFO | sent #: 0 {estimateL() / 1} Exception in thread "main" com.thoughtworks.xstream.converters.ConversionException: -1 : -1 ---- Debugging information ---- message : -1 cause-exception : java.lang.ArrayIndexOutOfBoundsException cause-message : -1 class : fr.lium.spkDiarization.libModel.gaussian.FullGaussian required-type : fr.lium.spkDiarization.libModel.gaussian.FullGaussian converter-type : com.thoughtworks.xstream.converters.reflection.ReflectionConverter path : /fr.lium.spkDiarization.libModel.ivector.EigenFactorRadialList/globalMeanCovariance/gender class[1] : fr.lium.spkDiarization.libModel.ivector.EigenFactorRadialList version : 8.4.1 09/12/2013 10:48 AM

i didn't install glpk via apt-get because as of today (UBUNTU 14.04) it is not working. I tried manually with the old fashion way but i don't know if everything went ok.

Any tips? Thank you Giuseppe

StevenLOL commented 10 years ago

Seems that you didn't hit ILP yet,

1) for ILP you can try

sudo apt-get install glpk-utils

to check if you have installed it:

$which glpsol

The error related to ILP is

Exception in thread "main" java.io.IOException: Cannot run program "glpsol": error=2, No such file or directory...

2) did you download UBM and models?

http://www-lium.univ-lemans.fr/diarization/lib/exe/fetch.php/data_ilp.tgz

3) I test all above on UBUNTU 14.04 just now no problem found.

libeppi commented 10 years ago

Thank you for your reply! i checked everything you suggested, and running "tree", gives me the same structure you posted in the homepage. Probably i'm still missing something. I'll let you know.

Thank you. . ├── go.sh ├── go.sh~ ├── ilp_diarization2.sh ├── ilp_diarization2.sh~ ├── LICENSE ├── LIUM_SpkDiarization-8.4.1.jar ├── mat │   ├── wld.efn.xml │   ├── wld.mahanalobis.mat │   └── wld.tv.mat ├── models │   ├── gender.gmms │   ├── s.gmms │   ├── sms.gmms │   └── ubm.gmm ├── README.md ├── t001.out.seg ├── test_out │   └── t001 │   ├── t001.d.3.seg │   ├── t001.flt.3.seg │   ├── t001.g.3.seg │   ├── t001.gmms │   ├── t001.h.3.seg │   ├── t001.init.gmms │   ├── t001.i.seg │   ├── t001.l.seg │   ├── t001.mfcc │   ├── t001.pms.seg │   └── t001.s.seg ├── test_wav │   └── t001.wav └── ubm └── wld.gmm

StevenLOL commented 10 years ago

Better check your "wld.efn.xml", download from

http://www-lium.univ-lemans.fr/diarization/lib/exe/fetch.php/data_ilp.tgz ?

libeppi commented 10 years ago

still not working, i just doublechecked and re-downloaded all the files needed. :(

StevenLOL commented 10 years ago

if your └── wld.gmm in folder ubm?

libeppi commented 10 years ago

yes, i re-checked again, nothing yet image

libeppi commented 10 years ago

Steven, can you please paste here what happens after: 02:56.304 MatrixIO INFO | read ./mat/wld.tv.mat: row : 3072 col:50 {readRectMatrix() / 1} 02:56.450 TrainIVectorOrTINFO | Compute statistics {make() / 1} 02:56.628 TrainIVectorOrTINFO | Train i-vector {make() / 1} 02:56.629 TotalVariabilitINFO | sent #: 0 {estimateL() / 1}

I think i'm getting closer to the issue. Probably it's not able to open mat/wld.efn.xml Thank you

libeppi commented 10 years ago

Problem solved - JDK 1.8 was the problem, i downgraded to 1.7 and it is now working

StevenLOL commented 10 years ago

Sorry, the "glpsol1" in ilp_diarization2.sh should be glpsol . I saw your comment on my mobile phone, but I cant't find it on github.

For this issue, maybe it is caused by Java? Or system encoding ? I have this application running on 64bits ubuntu 10.04, 12.04, 12.10 ,14.04, using openjre7 or Oracle java 7, but never encounter this problem. I would like to know your platform, and do you test it on a Visual Machine or Live CD?

StevenLOL commented 10 years ago

Oh,problem solved, thanks libeppi

libeppi commented 10 years ago

:+1:

zXpp commented 8 years ago

hi,i agree with him that your script is great ,but i met problem when i run the program ,i want to get the output file including: "t001.ilp.problem.120.txt" and "t001.ilp.solition.120.txt" . but when i ./go.sh it wont show these two files .the first is your output files,the second is my tree structure.you can see there are no these two files.I already installed glpk ,and it can work .the last is my result information.so please help slover this problem !thank u
np0n tabjllq_s1hhfz7kom

image qq 20160406224854

zXpp commented 8 years ago

ok my problem is also solved using jdk 1.7!

zXpp commented 8 years ago

this is the content of the file--t001.ev_is.120.seg.can u tell me what the meaning of the figures and parameters? thank u! ![Uploading FN(G6GMI)UGNC$8)6U@8EVX.png…]()

zXpp commented 8 years ago

fn g6gmi ugnc 8 6u 8evx

libeppi commented 8 years ago

Hi zxpp

for example:

9981217_0700_0800_inter_fm_dga 1 1 317 F S U spk0

1 9981217_0700_0800show name 2 1 channel number 3 1 segment start 4 317 lenght of segment 5 F Speaker gender (U unknown, F female, M man) 6 S bandwidth type (T telephone, S studio) 7 U enviroment type (music, speech, ecc) 8 Spk0 speaker label

zXpp commented 8 years ago

2054483722 thank u ! i got it ,but i still have further questions need your help: the picutre is another wave file result clusterset; 1.have u noticed that almost every cluster enviroment type were U,but it was music or speech or laugher obviously,why all were detected to unknown? 2 .this is the clusterset reslut,do u know how to run the next step and get the final DER result like the following chart?? the LIUM website didnt mentioned it ,and i am not familiar with java,can u help me ? thank you ! 8724b416-af99-49c8-9369-5028e737f113

StevenLOL commented 8 years ago

Hi,@zxpp If you have the evaluation data, then I think there should have documents on system output format and how to get the DER.

You may find useful tools at NIST :http://www.nist.gov/itl/iad/mig/tools.cfm

ahmetaa commented 7 years ago

So why is it failing in Java 8?

ahmetaa commented 7 years ago

Just FYI, I made a copy of Lium Speaker Diarization code (under GPL) that works with Java-8.

https://github.com/ahmetaa/lium-diarization

The problem was the XStream classes, I updated it and create a build script that can generate a new jar-with-dependencies file. You can open the project under IDEA and run the "stand-alone-jar" target.