Ing. Pražák Aleš, Ph.D.
Activities
Large vocabulary continuous speech recognition (LVCSR)
Courses
Course Lecturer
- Introduction in Shadow Speaker Practice (KKY/SRU)
- Shadow Speaker Training (KKY/SRP)
- Speech Analysis and Recognition (KKY/ARŘ)
Practical Lesson Lecturer
- Introduction in Shadow Speaker Practice (KKY/SRU)
- Shadow Speaker Training (KKY/SRP)
- Speech Analysis and Recognition (KKY/ARŘ)
Publications
+ / - Publications in year 2018
UWebASR – Web-based ASR engine for Czech and Slovak .
CLARIN Annual Conference 2018 Proceedings,
p. 190-193,
2018.
:
First Insight into the Processing of the Language Consulting Center Data .
Speech and Computer 20th International Conference (SPECOM 2018),
p. 778-787,
Cham: Springer Nature Switzerland AG,
2018.
:
Towards Processing of the Oral History Interviews and Related Printed Documents .
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018),
2104,
European Language Resources Association (ELRA),
2018.
:
+ / - Publications in year 2014
Captioning of Live TV Commentaries from the Olympic Games in Sochi: Some Interesting Insights .
Lecture Notes in Artificial Intelligence,
vol. 8655,
p. 515-522,
Springer,
2014.
:
General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes .
Language Resources and Evaluation,
p. 227-248,
2014.
:
+ / - Publications in year 2013
Online Speaker Adaptation of an Acoustic Model using Face Recognition .
Text, Speech and Dialogue, Proceedings of the 16th International Conference TSD 2013,
Lecture Notes in Artificial Intelligence,
vol. 8082,
p. 378-385,
Springer Berlin Heidelberg,
2013.
:
+ / - Publications in year 2012
Neural Network Language Model with Cache .
Lecture Notes in Computer Science,
p. 528-534,
2012.
:
+ / - Publications in year 2011
Automatic Topic Identification for Large Scale Language Modeling Data Filtering .
Text, Speech and Dialogue,
Lecture Notes in Computer Science,
vol. 6836,
p. 64-71,
Springer,
Heidelberg,
2011.
:
FOUR-PHASE RE-SPEAKER TRAINING SYSTEM .
Proceedings of the International Conference on Signal Processing and Multimedia Applications,
p. 217-220,
2011.
:
+ / - Publications in year 2010
Online TV captioning of Czech Parliamentary Sessions .
Text, Speech and Dialogue,
Lecture Notes in Computer Science,
vol. 6231,
p. 416-422,
Springer Berlin / Heidelberg,
2010.
:
Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions .
Lecture Notes in Computer Science,
vol. 2010,
p. 385-391,
Springer,
Heidelberg,
2010.
:
+ / - Publications in year 2009
Discriminative training of gender-dependent acoustic models .
Text, Speech and Dialogue,
p. 331-338,
Springer,
Plzeň,
2009.
:
Training of Speaker-Clustered Acoustic Models for Use in Real-Time Recognizers .
Proceedings of the International Conference on Signal Processing and Multimedia Application,
p. 131-135,
INSTICC,
Miláno,
2009.
:
Methods of Unsupervised Adaptation in Online Speech Recognition .
SPECOM'2009 Proceedings,
p. 448-453,
2009.
:
Fast Speaker Adaptation in Automatic Online Subtitling .
SIGMAP,
p. 126-130,
Italy,
2009.
:
+ / - Publications in year 2008
Automatic Speech Recognition and Information Retrieval Techniques for Facilitating Access to Video Archives of Cultural Heritage .
IEEE SMC International Conference on Distributed Human-Machine Systems,
vol. ;,
p. 323-328,
Czech Technical University,
Atény,
2008.
:
Efficient Combination of N-gram Language Models and Recognition Grammars in Real-Time LVCSR Decoder .
9th International Conference on Signal Processing Proceedings,
vol. ;,
p. 587-591,
IEEE,
Peking, Čína,
2008.
:
Multiple Application of the MLLT Based on Clustering Supported by Phonetic Knowledge .
9th International Conference on Signal Processing Proceedings,
vol. ;,
p. 613-617,
IEEE,
Peking, Čína,
2008.
:
+ / - Publications in year 2007
Keyword Spotting in LVCSR Based Word Lattices for Large Multimedia Search .
SPECOM 2007 Proceedings,
p. 393-398,
Moskow State Linguistic University,
Moscow,
2007.
:
Language Model Adaptation Using Different Class-Based Models .
SPECOM 2007 Proceedings,
p. 449-454,
Moscow State Linguistic University,
Moscow,
2007.
:
LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling .
SIGMAP 2007,
,
p. 139-142,
INSTICC PRESS,
Lisabon,
2007.
:
Searching for a robust MFCC-based parameterization for ASR application .
SIGMAP 2007,
,
p. 196-199,
INSTICC PRESS,
Lisabon,
2007.
:
Systém automatického vyhledávání klíčových segmentů v rozsáhlém audiovizuálním archivu hokejových zápasů .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2007.
:
Trenažér pro trénování stínových řečníků .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2007.
:
+ / - Publications in year 2006
Adaptive language model in automatic online subtitling .
Proceedings of the second IASTED international conference on Computational intelligence,
p. 479-483,
ACTA Press,
Anaheim,
2006.
:
Automatic online subtitling of the Czech parliament meetings .
Lecture Notes in Artificial Intelligence,
Lecture notes in artificial intelligence. 0302-9743 ; 4188,
4188,
p. 501-508,
Springer,
Berlin,
2006.
:
Benefit of a class-based language model for real-time closed-captioning of TV ice-hockey commentaries .
Proceedings of LREC 2006,
p. 2064-2067,
ELRA,
Paris,
2006.
:
+ / - Publications in year 2005
LVCSR system for automatic online subtitling .
SPECOM 2005 proceedings,
p. 325-328,
Moscow State Linguistics University,
Moscow,
2005.
:
+ / - Publications in year 2004
Real-time decoder for LVCSR system .
The 8th world multi-conference on systemics, cybernetics and informatics : vol. VI : image, acoustic, signal processing and optical systems, technologies and applications,
p. 450-454,
International Institute of Informatics and Systemics,
Orlando, Florida,
2004.
:
+ / - Publications in year 2003
Voice assimilation phenomenon and its implementation in LVCSR system with lexical tree and bigram language model .
WSEAS TRANSACTIONS on COMPUTERS,
p. 762-765,
WSEAS,
Greece,
2003.
:
Design of LVCSR decoder for Czech language .
ECMS 2003,
p. 39-43,
Technical University ,
Liberec ,
2003.
: