解析LIUM扬声器二值化输出

时间:2017-07-25 16:50:26

标签: java voice-recognition speech

我如何通过使用LIUM Speaker Diarization工具包知道哪位发言者说了多少时间?

例如,这是我的.seg文件。

;; cluster S0 [ score:FS = -33.93166562542459 ] [ score:FT = 
-34.24966646974656 ] [ score:MS = -34.05223781565528 ] [ score:MT = 
-34.32834794609819 ] 
Seq06 1 0 237 F S U S0
Seq06 1 2960 278 F S U S0
;; cluster S1 [ score:FS = -33.33289449700619 ] [ score:FT = 
-33.64489165914674 ] [ score:MS = -32.71833169822944 ] [ score:MT = 
-33.380835069917275 ] 
Seq06 1 238 594 M S U S1
Seq06 1 1327 415 M S U S1
Seq06 1 2311 649 M S U S1
;; cluster S2 [ score:FS = -33.354874450638064 ] [ score:FT = 
-33.46618707052516 ] [ score:MS = -32.70702429201772 ] [ score:MT = 
-33.042146088874844 ] 
Seq06 1 832 495 M S U S2
Seq06 1 1742 569 M S U S2

如何从此文件中提取时间?

1 个答案:

答案 0 :(得分:3)

在这一行

Seq06 1 2960 278 F S U S0

你有

field 1: 19981217_0700_0800_inter_fm_dga = the show name
field 2: 1 the channel number
field 3: 1 the start of the segment (in features)
field 4: 317 the length of the segment (in features)
field 5: F the speaker gender (U=unknown, F=female, M=Male)
field 6: S the type of band (T=telephone, S=studio)
field 7: U the type of environment (music, speech only, …)
field 8: spk0 the speaker label

时间属于特征,因此2960是29.60秒(除以特征秒转换为100)。长度也在特征中,因此您的段长度为2.78秒。

LIUM WIKI

中记录