Publication list - Idiap Publications

BROWSE
EXPORT
- Export all publications
SORT BY
- Author
- Title
- Type/journal
- Year
- Recently added

| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49 | 50 | 51 | 52 | 53 | 54 | 55 | 56 | 57 | 58 | 59 | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 | 69 | 70 | 71 | 72 | 73 | 74 | 75 | 76 | 77 | 78 | 79 |

A Kernel Wrapper for Phoneme Sequence Recognition, Joseph Keshet and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

A Large Margin Algorithm for Forced Alignment, Joseph Keshet, Shai Shalev-Shwartz, Yoram Singer and Dan Chazan, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, Joseph Keshet, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

Accessing a Large Multimodal Corpus using an Automatic Content Linking Device, Andrei Popescu-Belis, Jean Carletta, Jonathan Kilgour and Peter Poller, in: Multimodal Corpora: From Models of Natural Interaction to Systems and Applications, Springer-Verlag, 2009

[DOI]

Discriminative Keyword Spotting, David Grangier, Joseph Keshet and Samy Bengio, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009

Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions, Andrei Popescu-Belis, in: Multimodal Signal Processing for Human-Computer Interaction, Elsevier / Academic Press, 2009

Modeling interest in face-to-face conversations from multimodal nonverbal behavior, Daniel Gatica-Perez, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.,',','), Multimodal Signal Processing, Academic Press, Academic Press, 2009

Multi-Person Bayesian Tracking with Multiple Cameras, Jian Yao and Jean-Marc Odobez, in: Multi-camera networks: principles and applications, pages 363-388, Academic Press, 2009

How does a dictation machine recognize speech ?, T. Dutoit, L. Couvreur and Hervé Bourlard, in: Applied Signal Processing--A MATLAB approach, Springer MA, 2008

Towards an Objective Test for Meeting Browsers: the BET4TQB Pilot Experiment, Andrei Popescu-Belis, Philippe Baudrion, Mike Flynn and Pierre Wellner, in: Machine Learning for Multimodal Interaction IV, Springer-Verlag, 2008

[DOI]

Adaptation in Brain-Computer Interfaces, José del R. Millán, Anna Buttfield, C. Vidaurre, M. Krauledat, A. Schlögl, P. Shenoy, B. Blankertz, R.P.N. Rao, R. Cabeza, Gert Pfurtscheller and K. -R. Müller, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Error-Related EEG Potentials in Brain-Computer Interfaces, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Non-Invasive Estimates of Local Field Potentials for Brain-Computer Interfaces, R. Grave de Peralta Menendez, S. L. González Andino, Pierre W. Ferrez and José del R. Millán, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Tapping the Mind or Resonating Minds?, José del R. Millán, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007

The IDIAP Brain-Computer Interface: An Asynchronous Multi-Class Approach, José del R. Millán, Pierre W. Ferrez and Anna Buttfield, in: Towards Brain-Computer Interfacing, The MIT Press, 2007

Non-Invasive Brain-Actuated Control of a Mobile Robot by Human EEG, José del R. Millán, F. Renkens, J. Mouriño and W. Gerstner, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006

Brain-Computer Interfaces, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Hidden Markov Models and other Finite State Automata for Sequence Processing, Hervé Bourlard and Samy Bengio, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Robot Navigation, José del R. Millán, in: The Handbook of Brain Theory and Neural Networks: The Second Edition, The MIT Press, 2002

Towards Robust and Adaptive Speech Recognition Models, Hervé Bourlard, Samy Bengio and Katrin Weber, in: Mathematical Foundations of Speech Processing and Recognition, Springer-Verlag, 2002

Automatic Speech Recognition: an Auditory Perspective, Nelson Morgan, Hervé Bourlard and Hynek Hermansky, in: Speech Processing in the Auditory System, Springer Verlag, New York, 2000

Neural Networks in Automatic Speech Recognition, F. Beaufays, Hervé Bourlard, H. Franco and Nelson Morgan, in: to be published in The Handbook of Brain Theory and Neural Networks, Bradford Books, The MIT Press, 2000

Indexing Audio Documents by using Latent Semantic Analysis and SOM, Mikko Kurimo, in: Kohonen Maps, Elsevier, 1999

Speech Reading, Juergen Luettin, in: Modern Interface Technology: The Leading Edge, Research Studies Press Ltd., 1999

Connectionist Techniques, Hervé Bourlard and Nelson Morgan, in: Survey of the State of the Art in Human Language Technology, Cambridge University Press, 1998

Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions, Hervé Bourlard and Nelson Morgan, in: Adaptive Processing of Sequences and Data Structures, Springer Verlag, 1998

Ellipsometry, Indu Saxena, in: Optical Metrology, Artech House, 1997

Neural Network Adaptations to Hardware Implementations, Perry Moerland and Emile Fiesler, in: Handbook of Neural Computation, Institute of Physics Publishing and Oxford University Publishing, 1997

Active Shape Models for Visual Speech Feature Extraction, Juergen Luettin, Neil A. Thacker and Steve W. Beet, in: Speechreading by Humans and Machines, Springer Verlag, 1996

Machine Recognition and Applications, Juergen Luettin, Michael Vogt and Christoph Bregler, in: Speechreading by Humans and Machines, Springer Verlag, 1996

Neural Network Topologies, Emile Fiesler, in: Handbook of Neural Computation, 1996

Reconnaissance et compréhension de la parole: évaluation et applications, F. Néel, Gérard Chollet, F. Lamel, W. Minker and Andrei Constantinescu, in: Fondements et perspectives en traitement automatique de la parole, AUPELF -- UREF, 1996

Supervised Ontogenic Networks, Emile Fiesler and K. Cios, in: Handbook of Neural Computation, 1996

A Hybrid Approach to Continuous Speech Recognition, Kari Torkkola and Teuvo Kohonen, in: The handbook of brain theory and neural networks, The MIT Press, 1995

An All-Optical Forward Propagation Multilayer Neural Network, Indu Saxena and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

Applying Handwriting Recognition to US Census Forms, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995

Assessment of speaker verification systems, Gérard Chollet and Frédéric Bimbot, in: Spoken Language Ressources and Assessment, EAGLES Handbook, 1995

Handwriting Recognition, Thomas M. Breuel, in: Recent Developments in Computer Vision, Springer, 1995

Les domaines d'application des technologies vocales, Gérard Chollet, in: Fondements et perspectives en traitement automatique de la parole, GDR-PRC Communication Homme-Machine, 1995

Neural Network Initialization, Georg Thimm and Emile Fiesler, in: From Natural to Artificial Neural Computation, Springer Verlag, 1995

A Differentiable Integer Linear Programming Solver for Explanation-Based Natural Language Inference, Mokanarangan Thayaparan, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

CONTENT-BASED OBJECTIVE EVALUATION OF ARTIFICIALLY GENERATED SIGN LANGUAGE VIDEOS, Neha Tarigopula, Preyas Garg, Skanda Muralidhar, Sandrine Tornay, Dinesh Babu Jayagopi and Mathew Magimai.-Doss, in: ICASSP, 2024

CONTEXTUAL BIASING METHODS FOR IMPROVING RARE WORD DETECTION IN AUTOMATIC SPEECH RECOGNITION, Mrinmoy Bhattacharjee, Nigmatulina Iuliia, Amrutha Prasad, Pradeep Rangappa, Srikanth Madikeri, Petr Motlicek, Hartmut Helmke and Matthias Kleinert, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP) 2024, Seoul, Korea, 2024

DAIC-WOZ: On the Validity of Using the Therapist's prompts in Automatic Depression Detection from Clinical Interviews, Sergio Burdisso, Ernesto A. Reyes-Ramírez, Esaú VILLATORO-TELLO, Fernando Sánchez-Vega, A. Pastor López-Monroy and Petr Motlicek, in: Proceedings of the 6th Clinical Natural Language Processing Workshop, Association for Computational Linguistics, 2024

Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition, Behrooz Razeghi, Parsa Rahimi and Sébastien Marcel, in: 49th IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, 2024

Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement, Xin Quan, Marco Valentino, Louise A Dennis and Andre Freitas, in: The 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024), 2024

Estimating the Causal Effects of Natural Logic Features in Transformer-Based NLI Models, Julia Rozanova, Marco Valentino and Andre Freitas, in: The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, 2024

Face Recognition Using Lensless Camera, Hatef Otroshi Shahreza, Alexandre Veuthey and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

[DOI]
[URL]

Face Reconstruction from Partially Leaked Facial Embeddings, Hatef Otroshi Shahreza and Sébastien Marcel, in: Proceedings of the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2024

[DOI]
[URL]

Fine-tuning Self-Supervised Models For Language Identification Using Orthonormal Constraint, Amrutha Prasad, Andrés Carofilis, Geoffroy Vanderreydt, Driss Khalil, Srikanth Madikeri, Petr Motlicek and Schüpbach Christof, in: Proceedings of the 49th IEEE International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024

processing time: 0.6103 seconds.