Ing. Zajíc Zbyněk, Ph.D.
Supervising student projects
název | type year |
supervisor specialization |
assigned |
---|---|---|---|
Detekce úseků nahrávky s více řečníky mluvícími přes sebe |
PRJ4 PRJ5 BP DP |
Zajíc Zbyněk
UI |
volné |
Diarizace řečníka |
PRJ4 PRJ5 BP DP |
Zajíc Zbyněk
UI |
volné |
Srovnání různých reprezentací řečníků pro úlohu rozpoznávání mluvčích |
BP |
Zajíc Zbyněk
UI |
volné |
Zobrazení výsledků rozpoznání řečníků v nahrávce |
SPC PRJ4 PRJ5 |
Zajíc Zbyněk
UI |
volné |
Hierarchické metody shlukování v akustickém prostoru. |
BP 2014/2015 |
Zajíc Zbyněk
UI |
dokončeno (Tomáš Jindáček) |
Testování neuronových sítí |
PRJ4 2014/2015 |
Zajíc Zbyněk
UI |
dokončeno (Jan Šmejkal) |
Publications
+ / - Publications in year 2021
Applying EEND Diarization to Telephone Recordings from a Call Center .
SPECOM,
Lecture Notes in Computer Science,
vol. 12997,
p. 807-817,
Springer, Cham,
Karpov A., Potapova R.,
2021.
:
+ / - Publications in year 2020
Diarization Based on Identification with X-Vectors. .
Speech and Computer, 22nd International Conference, SPECOM 2019, St. Petersburg, Russia, October 7-9,2020, Proceedings.,
p. 667-678,
2020.
:
An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents .
SPECOM: International Conference on Speech and Computer,
Lecture Notes in Computer Science ,
vol. 12335,
p. 166-175,
Springer, Cham,
2020.
:
Speech and web-based technology to enhance education for pupils with visual impairment .
Journal on Multimodal User Interfaces,
Springer Nature Switzerland AG,
2020.
:
+ / - Publications in year 2019
UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge .
Interspeech,
p. 993-997,
2019.
:
Detection of Overlapping Speech for the Purposes of Speaker Diarization .
Speech and Computer (SPECOM 2019),
p. 247-257,
Springer, Cham,
2019.
:
Diarization of The Language Consulting Center Telephone Calls .
Speech and Computer (SPECOM 2019),
p. 549-558,
Springer, Cham,
2019.
:
+ / - Publications in year 2018
ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge .
Interspeech,
p. 2788-2792,
2018.
:
First Insight into the Processing of the Language Consulting Center Data .
Speech and Computer 20th International Conference (SPECOM 2018),
p. 778-787,
Cham: Springer Nature Switzerland AG,
2018.
:
Recurrent Neural Network Based Speaker Change Detection from Text Transcription Applied in Telephone Speaker Diarization System .
Text, Speech, and Dialogue 21st International Conference, TSD 2018,
p. 342-350,
Cham: Springer Nature Switzerland AG,
2018.
:
Towards Processing of the Oral History Interviews and Related Printed Documents .
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018),
2104,
European Language Resources Association (ELRA),
2018.
:
+ / - Publications in year 2017
Experiments with Segmentation in an Online Speaker Diarization System .
Text, Speech and Dialogue, Proceedings of the 20th International Conference TSD 2017,
Lecture Notes in Computer Science,
vol. 10415,
p. 429-437,
Springer,
2017.
:
Convolutional Neural Network for speaker change detection in telephone speaker diarization system .
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
p. 4945-4949,
IEEE,
2017.
:
Neural Network Speaker Descriptor in Speaker Diarization of Telephone Speech .
Speech and Computer 19th International Conference (SPECOM 2017),
p. 555-563,
Springer,
2017.
:
Speaker Diarization Using Convolutional Neural Network for Statistics Accumulation Refinement .
Interspeech, 18th Annual Conference of the International Speech Communication Association,
p. 3562-3566,
2017.
:
First Insight into the Processing of the Historical Documents from the Period of Totalitarian Regimes .
Data a znalosti 2017,
p. 89-92,
Plzeň: ZČU v Plzni,
2017.
:
+ / - Publications in year 2016
Fisher Vectors in PLDA Speaker Verification System .
The IEEE 13th International Conference on Signal Processing, ICSP 2016,
p. 1338-1341,
IEEE Press,
2016.
:
Investigation of Segmentation in i-Vector Based Speaker Diarization of Telephone Speech .
Speech and Computer 18th International Conference, SPECOM 2016,
p. 411-418,
Springer,
2016.
:
+ / - Publications in year 2014
Comparison of Score Normalization Methods Applied to Multi-label Classification .
IEEE International Symposium on Signal Processing and Information Technology,
Institute of Electrical and Electronics Engineers ( IEEE ),
Noida, India,
2014.
:
Convolutional Neural Network for Refinement of Speaker Adaptation Transformation .
16th International Conference on Speech and Computer, SPECOM 2014,
Lecture Notes in Artificial Intelligence,
vol. 8773,
p. 161-168,
2014.
:
Score Normalization Methods Applied to Topic Identification .
Text, Speech, and Dialogue, 17th International Conference, TSD 2014,
Lecture Notes in Artificial Intelligence,
vol. 8655,
p. 133-140,
Springer,
2014.
:
+ / - Publications in year 2013
A Direct Criterion Minimization based fMLLR via Gradient Descend .
Text, Speech, and Dialogue,
Lecture Notes in Computer Science,
vol. 8082,
p. 52-59,
Springer,
2013.
:
+ / - Publications in year 2012
Factor Analysis and Nuisance Attribute Projection Revisited .
Interspeech 2012,
p. 1570-1573,
Curran Associates, Inc.,
2012.
:
Analysis of the Influence of Speech Corpora in the PLDA Verification in the Task of Speaker Recognition .
Lecture Notes in Computer Science,
p. 464-471,
2012.
:
Bottleneck ANN: dealing with small amount of data in shift-MLLR adaptation .
IEEE 11th International Conference on Signal Processing,
p. 507-510,
Beijing,
2012.
:
Initialization of Adaptation by Sufficient Statistics Using Phonetic Tree .
IEEE 11th International Conference on Signal Processing,
p. 503-506,
Beijing,
2012.
:
On Complementarity of State-of-the-art Speaker Recognition Systems .
IEEE International Symposium on Signal Processing and Information Technology,
p. 164-169,
IEEE,
2012.
:
Robust Adaptation Techniques Dealing with Small Amount of Data .
Text, Speech and Dialogue (TSD 2012),
p. 480-487,
Springer, Berlin,
2012.
:
+ / - Publications in year 2011
Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA .
The 12th International Conference on Parallel and Distributed Computing, Applications and Technologies,
p. 167-172,
IEEE Computer Society Conference Publishing Services (CPS),
2011.
:
Initialization of fMLLR with Sufficient Statistics from Similar Speakers .
Lecture Notes in Computer Science,
vol. 6836/2011,
p. 187-194,
Springer-Verlag Berlin Heidelberg,
2011.
:
+ / - Publications in year 2010
Discriminative adaptation based on fast combination of DMAP and DfMLLR .
Interspeech 2010,
p. 534-537,
ISCA,
2010.
:
Robust Statistic Estimates for Adaptation in the Task of Speech Recognition .
Lecture Notes in Computer Science,
vol. 6231/2010,
p. 464-471,
Springer-Verlag Berlin Heidelberg,
2010.
:
SMOOTHING FACTOR IN DISCRIMINATIVE FEATURE ADAPTATION .
Studentská Vědecká Konference,
p. 57-58,
2010.
:
+ / - Publications in year 2009
Methods of Unsupervised Adaptation in Online Speech Recognition .
SPECOM'2009 Proceedings,
p. 448-453,
2009.
:
Fast Speaker Adaptation in Automatic Online Subtitling .
SIGMAP,
p. 126-130,
Italy,
2009.
:
Refinement Approach for Adaptation Based on Combination of MAP and fMLLR .
Lecture Notes in Computer Science,
vol. Volume 5729/2009,
p. 274-281,
2009.
:
+ / - Publications in year 2008
An Expert System in Speaker Verification Task .
Proceedings of Interspeech 2008 incorporating SST 2008,
vol. 9,
p. 355-358,
International Speech Communication Association,
Brisbane, AU,
2008.
:
Automatická adaptace akustického modelu .
ZČU,
2008.
:
+ / - Publications in year 2007
The Speaker Adaptation of an Acoustic Model .
The 1st Young Researchers Conference on Applied Sciences,
p. 212-217,
Západočeská univerzita,
Plzeň,
2007.
:
A Cohort Methods for Score Normalization in Speaker Verification System, Acceleration of On-line Cohort Methods .
Specom 2007 Proceedings,
p. 367-372,
Moskow State Linguistic University,
Moskow,
2007.
: