logo Idiap Research Institute        
All publications sorted by journal and type
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |


Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods (2009)

A Kernel Wrapper for Phoneme Sequence Recognition, Joseph Keshet and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009
A Large Margin Algorithm for Forced Alignment, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009
A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, Joseph Keshet, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

Multimodal Corpora: From Models of Natural Interaction to Systems and Applications (2009)

Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009
attachment
[DOI]

Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods (2009)

Discriminative Keyword Spotting, David Grangier, Joseph Keshet and Samy Bengio, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

Multimodal Signal Processing for Human-Computer Interaction (2009)

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press (2009)

Modeling interest in face-to-face conversations from multimodal nonverbal behavior, Daniel Gatica-Perez, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009
attachment

Multi-camera networks: principles and applications (2009)

Multi-Person Bayesian Tracking with Multiple Cameras, Jian Yao and Jean-Marc Odobez, in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009
attachment

Applied Signal Processing--A MATLAB approach (2008)

How does a dictation machine recognize speech ?, T. Dutoit, L. Couvreur and Hervé Bourlard, in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008
attachment

Machine Learning for Multimodal Interaction IV (2008)


Towards Brain-Computer Interfacing (2007)

Error-Related EEG Potentials in Brain-Computer Interfaces, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

European Visions for the Knowledge Age (2007)

Tapping the Mind or Resonating Minds?, José del R. Millán, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007

Towards Brain-Computer Interfacing (2007)


2006 IMIA Yearbook of Medical Informatics (2006)

Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006

The Handbook of Brain Theory and Neural Networks: The Second Edition (2002)

Brain-Computer Interfaces, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment
Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment
Robot Navigation, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002
attachment

Mathematical Foundations of Speech Processing and Recognition (2002)

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002
attachment

Speech Processing in the Auditory System (2000)

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000

to be published in The Handbook of Brain Theory and Neural Networks (2000)

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000

Kohonen Maps (1999)


Modern Interface Technology: The Leading Edge (1999)

Speech Reading, Juergen Luettin, in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999

Survey of the State of the Art in Human Language Technology (1998)

Connectionist Techniques, Hervé Bourlard and Nelson Morgan, in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998

Adaptive Processing of Sequences and Data Structures (1998)

Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998

Optical Metrology (1997)

Ellipsometry, Indu Saxena, in: Optical Metrology, Artech House, 1997

Handbook of Neural Computation (1997)

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997
attachment

Speechreading by Humans and Machines (1996)

Active Shape Models for Visual Speech Feature Extraction, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Speechreading by Humans and Machines, Springer Verlag, 1996
attachment
Machine Recognition and Applications, Juergen Luettin, Michael Vogt and Christoph Bregler, in: Speechreading by Humans and Machines, Springer Verlag, 1996

Handbook of Neural Computation (1996)

Neural Network Topologies, Emile Fiesler, in: Handbook of Neural Computation, 1996

Fondements et perspectives en traitement automatique de la parole (1996)

Reconnaissance et compréhension de la parole: évaluation et applications, F. Néel, Gérard Chollet, F. Lamel, W. Minker and Andrei Constantinescu, in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996

Handbook of Neural Computation (1996)

Supervised Ontogenic Networks, Emile Fiesler and K. Cios, in: Handbook of Neural Computation, 1996

The handbook of brain theory and neural networks (1995)

A Hybrid Approach to Continuous Speech Recognition, Kari Torkkola and Teuvo Kohonen, in: The handbook of brain theory and neural networks, The MIT Press, 1995

From Natural to Artificial Neural Computation (1995)

An All-Optical Forward Propagation Multilayer Neural Network, Indu Saxena and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

Recent Developments in Computer Vision (1995)

Applying Handwriting Recognition to US Census Forms, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995
attachment

Spoken Language Ressources and Assessment (1995)

Assessment of speaker verification systems, Gérard Chollet and Frédéric Bimbot, in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995

Recent Developments in Computer Vision (1995)

Handwriting Recognition, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995

Fondements et perspectives en traitement automatique de la parole (1995)

Les domaines d'application des technologies vocales, Gérard Chollet, in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995

From Natural to Artificial Neural Computation (1995)

Neural Network Initialization, Georg Thimm and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (2024)

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

ICASSP (2024)


Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024 (2024)

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024
attachment

Proceedings of the 6th Clinical Natural Language Processing Workshop (2024)


49th IEEE International Conference on Acoustics, Speech and Signal Processing (2024)

Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024
attachment

The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024) (2024)

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (2024)

Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, Julia Rozanova, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (2024)

Face Recognition Using Lensless Camera, Hatef Otroshi Shahreza, Alexandre Veuthey and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024
attachment
[DOI]
[URL]
Face Reconstruction from Partially Leaked Facial Embeddings, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024
attachment
[DOI]
[URL]

Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) (2024)

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |