Contact
name: Josef Psutka
phone:
+420 37763 2513 , 2507 , 2100 , 2113
office:
UN555
e-mail: psutka@kky.zcu.cz
Prof. Ing. Psutka Josef, CSc.
Activities
- Head of Department
- Coordinator - Center of Computational Linguistics
Courses
Course Guarantor
- Bachelor State Final Exam IS (KKY/BZIB)
- Artificial Intelligence (KKY/UI)
- Artificial Intelligence (KKY/UISZ)
- Bachelor State Final Exam CC (KKY/BZPŘ)
- Bachelor State Final Exam IC (KKY/BZIK)
- Basics of Cybernetics for Informatics (KKY/ZKYI)
- Decision Systems (KKY/ROSZ)
- Defence of Bachelor Thesis (KKY/OKŘTB)
- Defence of Bachelor Thesis CC (KKY/OBPŘ)
- Defence of Bachelor Thesis IC (KKY/OBIK)
- Defence of Bachelor Thesis IS (KKY/OBIB)
- Defence of Master (Ing.) Thesis (KKY/OKŘT)
- Defence of Master (Ing.) Thesis (KKY/OŘRS)
- Fundamentals of Cybernetics (KKY/ZKY)
- Introduction to Artificial Intelligence (KKY/UUI)
- Pattern Recognition (KKY/USK)
- Pattern Recognition and Machine Learning (KKY/SUR)
- Speech Analysis and Recognition (KKY/ARŘ)
- Systems of Perception and Understanding (KKY/SVP)
- Thesis Tutorial (KKY/DP)
- Thesis Tutorial CC (KKY/BPPŘ)
- Thesis Tutorial IC (KKY/BPIK)
- Thesis Tutorial IS (KKY/BPIB)
Course Lecturer
- Applied Cybernetics (KKY/AKSZ)
- Artificial Intelligence (KKY/UI)
- Basics of Cybernetics for Informatics (KKY/ZKYI)
- Fundamentals of Cybernetics (KKY/ZKY)
- Introduction to Artificial Intelligence (KKY/UUI)
- Pattern Recognition and Machine Learning (KKY/SUR)
- Speech Analysis and Recognition (KKY/ARŘ)
- Systems of Perception and Understanding (KKY/SVP)
Practical Lesson Lecturer
- Artificial Intelligence (KKY/UI)
- Pattern Recognition and Machine Learning (KKY/SUR)
- Systems of Perception and Understanding (KKY/SVP)
Publications
+ / - Publications in year 2014
Captioning of Live TV Commentaries from the Olympic Games in Sochi: Some Interesting Insights .
Lecture Notes in Artificial Intelligence,
vol. 8655,
p. 515-522,
Springer,
2014.
:
Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation .
Text, Speech and Dialogue, Proceedings of the 17th International Conference TSD 2014,
Lecture Notes in Artificial Intelligence,
2014.
:
Anti-Models: An Alternative Way to Discriminative Training .
Text Speech nad Dialoque - TSD 2014,
Text Speech nad Dialoque - TSD 2014,
p. 449-456,
Springer,
2014.
:
Sports Video Classification in Continuous TV Broadcasts .
The 12th IEEE International Conference on Signal Processing (ICSP'14),
HangZhou China,
2014.
:
+ / - Publications in year 2013
Estimation of Single-Gaussian and Gaussian Mixture Models for Pattern Recognition .
18th Iberoamerican Congress on Pattern Recognition,
Lecture Notes in Computer Science,
Springer,
2013.
:
Covariance Matrix Enhancement Approach to Train Robust Gaussian Mixture Models of Speech Data .
Speech and Computer,
Lecture Notes in Computer Science,
vol. 8113,
p. 92-99,
Springer,
2013.
:
Online Speaker Adaptation of an Acoustic Model using Face Recognition .
Text, Speech and Dialogue, Proceedings of the 16th International Conference TSD 2013,
Lecture Notes in Artificial Intelligence,
vol. 8082,
p. 378-385,
Springer Berlin Heidelberg,
2013.
:
+ / - Publications in year 2012
Full Covariance Gaussian Mixture Models Evaluation on GPU .
IEEE International Symposium on Signal Processing and Information Technology,
Vietnam, Ho Chi Minh City,
2012.
:
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
6,
vol. 20,
p. 1818-1828,
Institute of Electrical and Electronics Engineers ( IEEE ),
2012.
:
Slovak Unit-Selection Speech Synthesis: Creating a New Slovak Voice within a Czech TTS System ARTIC .
IAENG International Journal of Computer Science,
vol. 39,
p. 147-154,
2012.
:
+ / - Publications in year 2011
Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings .
Text, Speech, and Dialogue,
Lecture Notes in Computer Science,
vol. 6836,
p. 284-290,
Springer,
2011.
:
FOUR-PHASE RE-SPEAKER TRAINING SYSTEM .
Proceedings of the International Conference on Signal Processing and Multimedia Applications,
p. 217-220,
2011.
:
Czech Senior COMPANION: Wizard of Oz Data Collection and Expressive Speech Corpus Recording and Annotation .
Human Language Technology. Challenges for Computer Science and Linguistics,
Lecture Notes in Computer Science,
vol. 6562,
p. 280-290,
Springer Berlin / Heidelberg,
Vetulani, Zygmunt,
2011.
:
New Slovak Unit-Selection Speech Synthesis in ARTIC TTS System .
Proceedings of the World Congress on Engineering and Computer Science 2011,
p. 485-490,
San Francisco, USA,
2011.
:
+ / - Publications in year 2010
Online TV captioning of Czech Parliamentary Sessions .
Text, Speech and Dialogue,
Lecture Notes in Computer Science,
vol. 6231,
p. 416-422,
Springer Berlin / Heidelberg,
2010.
:
Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions .
Lecture Notes in Computer Science,
vol. 2010,
p. 385-391,
Springer,
Heidelberg,
2010.
:
+ / - Publications in year 2009
Czech Senior COMPANION: Wizard of Oz Data Collection and Expressive Speech Corpus Recording .
Human Language Technologies as a Challenge for Computer Science and Linguistics,
p. 266-269,
Wydawnictvo Poznanskie Sp. z o.o.,
Poznan, Poland,
2009.
:
Discriminative training of gender-dependent acoustic models .
Text, Speech and Dialogue,
p. 331-338,
Springer,
Plzeň,
2009.
:
Czech Broadcast Conversation Speech .
vol. LDC2009S02,
Linguistic Data Consortium,
Philadelphia, USA,
2009.
:
Training of Speaker-Clustered Acoustic Models for Use in Real-Time Recognizers .
Proceedings of the International Conference on Signal Processing and Multimedia Application,
p. 131-135,
INSTICC,
Miláno,
2009.
:
Using Morphological Information for Robust Language Modeling in Czech ASR System .
IEEE Transactions on Audio Speech and Language Processing,
vol. 17,
p. 840-847,
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC,
2009.
:
+ / - Publications in year 2008
Automatic Speech Recognition and Information Retrieval Techniques for Facilitating Access to Video Archives of Cultural Heritage .
IEEE SMC International Conference on Distributed Human-Machine Systems,
vol. ;,
p. 323-328,
Czech Technical University,
Atény,
2008.
:
Voice-supported electronic health record in dentistry .
Actas de Congreso INFOLAC2008,
vol. ;,
p. 1-3,
Asociación Argentina de Informática Médica,
Buenos Aires,
2008.
:
Biderectional Voice Interaction with Dental Electronic Health Record .
Med-e-Tel 2008,
1,
vol. 2008,
p. 289-293,
2008.
:
Voice-controlled Data Entry in Dental Electronic Health Record .
Studies in Health Technology and Informatics,
vol. 2008,
p. 529-534,
IOS Press,
Goteborg,
2008.
:
What Can and Cannot Be Found in Czech Spontaneous Speech Using Document-Oriented IR Methods ? UWB at CLEF 2007 CL-SR Track .
Lecture Notes in Computer Science,
vol. 5152,
p. 712-718,
2008.
:
+ / - Publications in year 2007
Feature space reduction and decorrelation in a large number of speech recognition experiments .
Signal and Image Processing,
,
p. 158-161,
ACTA Press,
Anaheim,
2007.
:
LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling .
SIGMAP 2007,
,
p. 139-142,
INSTICC PRESS,
Lisabon,
2007.
:
Hlasový dialog s počítačem .
Umělá inteligence 5,
p. 284-327,
Academia,
Praha,
2007.
:
An Intelligent Telephony Interface of Multiagent Decision Support Systems .
IEEE Transactions on Systems, Man, and Cybernetics,
37,
vol. 4,
p. 553-560,
2007.
:
Hungarian MALACH acoustic front-end .
Katedra kybernetiky, Fakulta aplikovaných věd Západočeské univerzity v Plzni,
2007.
:
Systém automatického vyhledávání klíčových segmentů v rozsáhlém audiovizuálním archivu hokejových zápasů .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2007.
:
Trenažér pro trénování stínových řečníků .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2007.
:
+ / - Publications in year 2006
Adaptive language model in automatic online subtitling .
Proceedings of the second IASTED international conference on Computational intelligence,
p. 479-483,
ACTA Press,
Anaheim,
2006.
:
Automatic transcription of audio archives for spoken document retrieval .
Proceedings of the second IASTED international conference on Computational intelligence,
p. 448-452,
ACTA Press,
Anaheim,
2006.
:
Comparison of keyword spotting methods for searching in speech .
Interspeech 2006,
p. 1894-1897,
ISCA,
Bonn,
2006.
:
Automatic online subtitling of the Czech parliament meetings .
Lecture Notes in Artificial Intelligence,
Lecture notes in artificial intelligence. 0302-9743 ; 4188,
4188,
p. 501-508,
Springer,
Berlin,
2006.
:
Recognition of spontaneous speech - some problems and their solutions .
CITSA 2006 ,
p. 169-172,
IIIS,
Orlando,
2006.
:
Benefit of a class-based language model for real-time closed-captioning of TV ice-hockey commentaries .
Proceedings of LREC 2006,
p. 2064-2067,
ELRA,
Paris,
2006.
:
Exploiting linguistic knowledge in language modeling of Czech spontaneous speech .
Proceedings of LREC 2006 ,
p. 2600-2603,
ELRA,
Paris,
2006.
:
Fast keyword spotting from acoustic baseforms .
Proceedings of the 11th international conference "Speech and computer" SPECOM'2006,
p. 79-99,
Anatolya Publisher,
St. Petersburg,
2006.
:
Dialogový systém pro přihlašování studentů na zkoušky .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2006.
:
Modul zpracování klíčových slov CZ .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2006.
:
Polish Malach Speech Corpus .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation,
2006.
:
Slovak Malach Speech Corpus .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation,
2006.
:
Slovak Spontaneaous Speech – Acoustic&Language Models (MALACH) .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University Baltimore, Shoah Visual History Foundation,
2006.
:
Mluvíme s počítačem česky .
p. 752,
Academia,
Prague,
2006.
:
+ / - Publications in year 2005
Multi-agent decision support systems with remote multimedia access .
IECON 2005,
p. 2204-2209,
IEEE ,
Raleigh ,
2005.
:
Automatic transcription of Czech, Russian and Slovak spontaneous speech in the MALACH project .
Interspeech Lisboa 2005,
p. 1349-1352,
ISCA,
Bonn,
2005.
:
Czech spontaneous speech corpus with structural metadata .
Interspeech Lisboa 2005,
p. 1165-1168,
ISCA,
Bonn,
2005.
:
Automatic transcription of Czech, Russian and Slovak spontaneous speech in the MALACH project .
Eurospeech,
vol. 1,
p. 1349-1352,
ISCA,
Bonn,
2005.
:
Man-machine communication by voice .
Interdisciplinary aspects of human-machine co-existence and co-operation,
p. 214-223,
Czech Technical University,
Prague,
2005.
:
Celouniverzitní telefonní dialogový systém informující o výsledcích přijímacího řízení na ZČU využívající ASR a TTS .
Katedra kybernetiky, Západočeská univerzita v Plzni,
2005.
:
Czech Spontaneaous Speech – Acoustic&Language Models (MALACH) .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University Baltimore, Shoah Visual History Foundation,
2005.
:
Russian Malach Speech Corpus .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation,
2005.
:
Russian Spontaneaous Speech – Acoustic&Language Models (MALACH) .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati,
2005.
:
Shoah - System for Spontaneous Speech Annotation .
Katedra kybernetiky, Fakulta aplikovaných věd Západočeské univerzity v Plzni, Johns Hopkins Univ. v Baltimore, Shoah Visual History Foundation,
2005.
:
+ / - Publications in year 2004
Automatic punctuation annotation in Czech broadcast news speech .
SPECOM´2004,
p. 319-325,
SPIIRAS,
Saint-Petersburg,
2004.
:
False alarms reduction in keyword spotting system .
The 8th world multi-conference on systemics, cybernetics and informatics : vol. VI : image, acoustic, signal processing and optical systems, technologies and applications,
p. 460-464,
International Institute of Informatics and Systemics,
Orlando, Florida,
2004.
:
Issues in annotation of the Czech spontaneous speech corpus in the MALACH project .
Fourth international conference on language resources and evaluation,
p. 607-610,
European Language Resources Association,
Lisbon,
2004.
:
The development of ASR for Slavic languages in the MALACH project .
International conference on acoustics, speech, and signal processing,
p. 749-752,
IEEE,
Piscataway,
2004.
:
The development of ASR for Slavic languages in the MALACH project .
Acoustics, Speech, and Signal Processing,
p. 749-752,
IEEE,
Piscataway,
2004.
:
Czech broadcast news speech .
p. 4,
Linguistic Data Consortium (LDC),
USA,
2004.
:
Czech broadcast news transcripts .
p. 4,
Linguistic Data Consortium (LDC),
USA,
2004.
:
Automatic recognition of spontaneous speech for access to multilingual oral history archives .
IEEE transactions on speech and audio processing,
vol. 4,
p. 420-435,
2004.
:
Czech Broadcast News Corpus .
Katedra kybernetiky, fakulta aplikovaných věd, Západočeská univerzita v Plzni (práva k šíření předána Linguistic Data Consortium, University of Pe,
2004.
:
+ / - Publications in year 2003
Improving a keyword spotting system using phoneme sequence .
WSEAS TRANSACTIONS on COMPUTER,
p. 751-755,
WSEAS,
Greece,
2003.
:
Voice assimilation phenomenon and its implementation in LVCSR system with lexical tree and bigram language model .
WSEAS TRANSACTIONS on COMPUTERS,
p. 762-765,
WSEAS,
Greece,
2003.
:
Experiments with automatic segmentation for Czech speech synthesis .
Text, Speech and Dialogue,
Lecture Notes in Computer Science,
vol. 2807,
p. 287-294,
Springer,
Berlin, Heidelberg,
2003.
:
Building LVCSR system for transcription of spontaneously pronounced russian testimonies in the MALACH project: initial steps and first results .
Lecture Notes in Artificial Intelligence,
Lecture Notes in Artificial Intelligence,
2807,
p. 327-332,
Springer,
Berlin,
2003.
:
Building LVCSR system for transcription of spontaneously pronounced russian testimonies in the MALACH project: initial steps and first results .
Lecture Notes in Computer Science,
Lecture Notes in Artificial Intelligence,
2607,
p. 327-332,
Springer,
Berlin,
2003.
:
Towards automatic transcription of spontaneous Czech speech in the MALACH project .
Lecture Notes in Artificial Intelligence,
Lecture Notes in Artificial Intelligence,
2807,
p. 214-219,
Springer,
Berlin ,
2003.
:
Automatic segmentation for Czech concatenative speech synthesis using statistical approach with boundary-specific correction .
Eurospeech 2003 - Interspeech, proceedings of the 8th European Conference on Speech Communication and Technology,
p. 301-304,
ISCA,
Geneva, Switzerland,
2003.
:
Fitting class-based language models into weighted finite-state transducer framework .
EUROSPEECH 2003 PROCEEDINGS,
p. 1873-1876,
ISCA,
Geneva,
2003.
:
Large vocabulary ASR for spontaneous Czech in the MALACH project .
EUROSPEECH 2003 PROCEEDINGS,
p. 1821-1824,
ISCA,
Geneva,
2003.
:
The Czech speech and prosody database both for ASR and TTS purposes .
EUROSPEECH 2003 PROCEEDINGS,
p. 1577-1580,
ISCA,
Geneva,
2003.
:
Fitting class-based language models into weighted finite-state transducer framework .
Eurospeech,
vol. 1,
p. 1873-1876,
ISCA,
Geneva,
2003.
:
Large vocabulary ASR for spontaneous Czech in the MALACH project .
Eurospeech,
vol. 1,
p. 1821-1824,
ISCA,
Geneva,
2003.
:
Automatic transcription of TV ice-hockey commentary .
Proceedings ,
p. 419-423,
International Institute of Informatics and Systemics,
Orlando,
2003.
:
Design of LVCSR decoder for Czech language .
ECMS 2003,
p. 39-43,
Technical University ,
Liberec ,
2003.
:
Recognition of spontaneously pronounced TV ice-hockey commentary .
ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition,
p. 83-86,
Tokyo Institute of Technology,
Tokyo,
2003.
:
Czech Malach Speech Corpus .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni, Johns Hopkins University v Baltimore, Shoah Visual History Foundati,
2003.
:
+ / - Publications in year 2002
German and Czech Speech Synthesis Using HMM-Based Speech Segment Database .
Text, Speech and Dialogue, proceedings of the 5th International Conference TSD 2002,
Lecture Notes in Artificial Intelligence,
vol. 2448,
p. 173-180,
Springer,
Berlin, Heidelberg,
2002.
:
Automatic transcription of Czech language oral history in the MALACH project: resources and initial experiments .
Lecture Notes in Artificial Intelligence,
2448,
p. 253-260,
2002.
:
+ / - Publications in year 2001
Design of Speech Corpus for Text-to-Speech Synthesis .
Eurospeech 2001 - Interspeech, proceedings of the 7th European Conference on Speech Communication and Technology,
p. 2047-2050,
Aalborg, Denmark,
2001.
:
Voice of America (VOA) Broadcast News Czech Transcript Corpus .
Katedra kybernetiky, Fakulta aplikovaných věd, Západočeská univerzita v Plzni (práva k šíření předána Linguistic Data Consortium, University of Pe,
2001.
:
+ / - Publications in year 2000
ARTIC: A New Czech Text-To-Speech System Using Statistical Approach to Speech Segment Database Construction .
Interspeech 2000 - ICSLP, proceedings of the sixth International Conference on Spoken Language Processing,
vol. 4,
p. 612-615,
Beijing, China,
2000.
:
Recording and Annotation of the Czech Speech Corpus .
Text, Speech and Dialogue, proceedings of the 3rd International Workshop TSD 2000,
Lecture Notes in Artificial Intelligence,
vol. 1902,
p. 319-323,
Springer,
Berlin, Heidelberg,
2000.
:
Classification of Transient Events of Nuclear Reactor Using Hidden Markov Model .
Acta Polytechnica,
3,
vol. 40,
p. 34-38,
ČVUT,
Prague,
2000.
:
+ / - Publications in year 1999
Statistical Approach to the Automatic Synthesis of Czech Speech .
Text, Speech and Dialogue, proceedings of the 2nd International Workshop TSD 1999,
Lecture Notes in Artificial Intelligence,
vol. 1672,
p. 376-379,
Springer,
Berlin, Heidelberg,
1999.
:
+ / - Publications in year 1997
An Approach to Speaker Identification Using Multiple Classifiers .
1997 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'97),
vol. Volume II: Speech Processing,
p. 1135-1138,
Institute of Electrical and Electronics Engineers, Inc.,
Munich, Germany,
1997.
:
+ / - Publications in year
System for Fast Lexical and Phonetic Spoken Term Detection in a Czech Cultural Heritage Archive .
EURASIP Journal on Audio, Speech, and Music Processing,
[submitted, in review],
Springer-Verlag, GmbH,
Heidelberg, Germany,
.
: