SJ Cox CV and Publications

Stephen Cox's Research Information Page

S.J. Cox Research CV (PDF)

Publications by S.J. Cox

2016

Visual Units and Confusion Modelling for Automatic Lip-reading
Dominic Howell, Stephen Cox and Barry Theobald
Image and Vision Computing, 2016, pp 1–12 DOI: 10.1016/j.imavis.2016.03.003

Bi-Text Alignment of Movie Subtitles for Spoken English-Arabic Statistical Machine Translation
Fahad Al-Obaidli, Stephen Cox and Preslav Nakov
CICLing 2016, 17th International Conference on Intelligent Text Processing and Computational Linguistics, April 2016, Konya, Turkey.

Improved Speaker Independent Lip Reading using Speaker Adaptive Training and Deep Neural Networks
Ibrahim Almajai, Stephen Cox, Richard Harvey, Yuxuan Lan
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Shanghai, 2016

2015

Speaker-independent machine lip-reading with speaker-dependent viseme classifiers
Helen Bear, Richard Harvey, Stephen Cox
Proc. FAAVSP, 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing September 2015, Vienna

Discovering Patterns in Visual Speech
Stephen Cox
Proc. FAAVSP, 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing September 2015, Vienna

Improving Lip-Reading Performance For Robust Audiovisual Speech Recognition Using Deep Neural Networks
Kwanchiva Thangthai, Richard Harvey, Stephen Cox, Barry-John Theobald
Proc. FAAVSP, 1st Joint Conference on Facial Analysis, Animation and Audio-Visual Speech Processing September 2015, Vienna

Detection of Anomalous Events in a Tennis Game Using Multimodal Information
Qiang Huang and Stephen Cox
Proc. Asia-Pacific Signal and Information Processing Association (APSIPA), Hong Kong, December 2015

2014

Automatic annotation of tennis games: An integration of audio, vision, and learning
Fei Yan, Josef Kittler, David Windridge, William Christmas, Krystian Mikolajczyk, Stephen Cox, Qiang Huang
Image and Vision Computing, 32 (2014) 896–903.

Unsupervised Model Selection for Recognition of Regional Accented Speech
Proc. 17th International Conference on Spoken Language Processing (Interspeech), Singapore, September 2014
Maryam Najafian, Andrea DeMarco, Stephen Cox, Martin Russell

2013

Recent developments in automated lip-reading
R. Bowden, S. Cox, R. Harvey, Y. Lan, E.J. Ong , G. Owen and B Theobald
SPIE Security and Defence 2013, Dresden, September 2013.

Confusion Modelling for Automated Lip-Reading using Weighted Finite-State Transducers
D. Howell, B. Theobald and S. Cox
12th International Conference on Auditory-Visual Speech Processing (AVSP) 2013

Native Accent Classification via I-Vectors and Speaker Compensation Fusion
A. DeMarco and S. Cox
Proc. 16th International Conference on Spoken Language Processing (Interspeech), Lyon, August 2013

A Two Layered Data Association Approach For Ball Tracking
X. Zhou, Q. Huang, L. Xie and S. Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Vancouver, 2013

2012

Iterative Classification of Regional British Accents via I-Vector Space
A. DeMarco and S. Cox
Symposium on Machine Learning in Speech and Language Processing(MLSLP), September 2012, Portland, Oregon, USA

Is automated conversion of video to text a reality?
R. Bowden S. Cox, R. Harvey, Y Lan, Eng-Jon Ong, G. Owen and B.Theobald
SPIR Security and Defence, Edinburgh, September 2012

Improved Audio Event Detection by Use of Contextual Noise
Q. Huang and S. Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Kyoto, 2012

Detection of Ball Hits in a Tennis Game Using Audio and Visual Information
Q. Huang, S. Cox, X. Zhou and L. Xie
Proc. Asia-Pacific Signal and Information Processing Association (APSIPA), Hollywood, December 2012

2011

An Accurate and Robust Gender Identification Algorithm
Andrea DeMarco and Stephen J. Cox
Proc. 14th International Conference on Spoken Language Processing (Interspeech), Florence, August 2011

Iterative Improvement of Speaker Segmentation using High-level Knowledge
Qiang Huang and Stephen Cox
Proc. 14th International Conference on Spoken Language Processing (Interspeech), Florence, August 2011

Learning Score Structure from Spoken Language for A Tennis Game
Qiang Huang and Stephen Cox
Proc. 14th International Conference on Spoken Language Processing (Interspeech), Florence, August 2011

Improved Detection of Ball Hit Events in a Tennis Game Using Multimodal Information
Qiang Huang, Stephen Cox, Fei Yan, Teo de Campos, David Windridge, Josef Kittler, William Christmas
Proc. International Conference on Auditory-Visual Speech Processing (AVSP) 2011

Inferring the Structure of a Tennis Game using Audio Information
Qiang Huang and Stephen Cox
IEEE Transactions on Audio, Speech & Language Processing, Vol. 19 No 7, pp. 1925–1937, September 2011.

2010

Shallow Parsing of a Tennis Game from Audio Events
Qiang Huang and Stephen Cox
Fourth International Conference on Intelligent Information Technology Applications (IITA 2010), Qinhuangdao, China, November 5 - 7, 2010

Limits of Visual Speech Recognition
J. L. Newman, B. J. Theobald and Stephen J. Cox
Proc. International Conference on Auditory-Visual Speech Processing 2010, Hakone, Kanagawa, Japan

Using High-level Information to Detect Key Audio Events in a Tennis Game
Qiang Huang and Stephen Cox
Proc. 13th International Conference on Spoken Language Processing (Interspeech), Makuhari, September 2010

Hierarchical Language Modeling for Audio Events Detection in a Sports Game
Qiang Huang and Stephen Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010

Speaker Independent Visual-Only Language Identification
Jacob Newman and Stephen Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010

2009

Example-Based Speech Recognition using Formulaic Phrases
Christopher Watkins and Stephen J. Cox
Proc. 12th International Conference on Spoken Language Processing (Interspeech), Brighton, September 2009

On the Estimation and the Use of Confusion-Matrices for Improving ASR Accuracy
Santiago Omar Caballero Morales and Stephen J. Cox
Proc. 12th International Conference on Spoken Language Processing(Interspeech), Brighton, September 2009

Deriving Cultural Representations of Features for Audio Music Similarity Estimation
K. West and S. Cox
IEEE Transactions on Audio, Speech & Language Processing Vol 18 no3, March 2010, pp 625--637

Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers
Santiago Omar Caballero Morales and Stephen J. Cox
EURASIP Journal on Advances in Signal Processing, Volume 2009 (2009), Article ID 308340, doi:10.1155/2009/308340

Automatic Visual-Only Language Identification: A Preliminary Study
J. Newman and S. Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Taiwan, 2009.

2008

The challenge of multispeaker lip-reading
S. Cox, R. Harvey, Y. Lan, and B.J. Theobald
Proc. International Conference on Auditory-Visual Speech Processing 2008, Tangalooma, Australia

Application of Weighted Finite-State Transducers to Improve Recognition Accuracy for Dysarthric Speech
O. Cabellero-Morales and S.J. Cox
Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008

On Estimation of Speakers' Confusion Matrices from Sparse Data
S.J. Cox
Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008

2007

Automatic Pitch Accent Prediction for Text-To-Speech Synthesis
Ian Read and Stephen Cox
Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007

Modelling Confusion-Matrices to Improve Speech Recognition Accuracy, with an Application to Dysarthric Speech
O. Cabellero-Morales and S.J. Cox
Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007

Analysis of user interaction with service oriented chatbot systems
M.C.Jenkins, R. Churchill, S.J. Cox and Dan Smith
Proc. HCI International 2007, Beijing

VANESSA -- A system for communication between deaf and hearing people
J.R.W. Glauert, R. Elliott, S.J. Cox, J. Tryggvason and M. Sheard
Technology and Disability, Vol 18, no 4, pp 207-216, 2007.

Stochastic and Syntactic Techniques for Predicting Phrase Breaks
I. Read and S.J. Cox
Computer Speech and Language, Volume 21, Issue 3, pp 519--542, July 2007.

2006

Task-Independent Call-Routing
Q. Huang and S.J. Cox
Speech Communication Vol. 48, Issues 3--4, March--April 2006, Pages 374--389.

Incorporating Machine Learning into Music Similarity Estimation
K. West, S. Cox and P. Lamere
Proceedings of the 1st ACM workshop on Audio and music computing multimedia, pp 89--96, Santa Barbara, California, USA, October 2006

Lip-reading enhancement for law enforcement
B. Theobald, R. Harvey, S. Cox, G. Owen, and C. Lewis
Proc. SPIE conference on Optics and Photonics for Counterterrorism and Crime Fighting, G. Owen and C. Lewis, Eds., vol. 6402, September 2006, pp. 205--9.

Computer technology in the analysis and development of the "Singer's Formant" in adult singing students
J. Davies and S.J. Cox
Proc. 3rd Physiology and Acoustics of Singing Conference (PAS3-06), York, May 2006

2005

VANESSA - A System for Communication between Deaf and Hearing People
JRW Glauert, R Elliott, SJ Cox, J Tryggvason, and M Sheard
Proceedings of the 8th Conference of the Association for the Advancement of Assistive Technology in Europe? (AAATE 2005), Chapter 24 pp 557-562, Lille 2005.

A study of computer-aided analysis in the training of singers
J. Davies, M. Wildman and S.J. Cox
Proc. 6th Pan European Voice Conference, PEVOC 6, London, September 2005

Stochastic and Syntactic Techniques for Predicting Phrase Breaks
I. Read and S.J. Cox
Proc. 9th European Conference on Speech Communication and Technology, Lisbon, September 2005

A Discriminative Approach to Phrase Break Modelling
S.J. Cox
Proc. 9th European Conference on Speech Communication and Technology, Lisbon, September 2005

Finding an Optimal Segmentation for Audio Genre Classification
K. West and S. Cox
Proc. 6th International Conference on Music Information Retrieval (ISMIR 2005), London, September 2005

2004

Modelling of Confusions in Aircraft Call Signs
S.J. Cox and L. Vinagre
Speech Communication Vol 42, Nos 3-4, pages 289--312, April 2004.

Automatic Call Routing with Multiple Mixture Language Models
Q. Huang and S.J. Cox
Proc. Int. Conf. on Spoken Language Processing, Korea, October 2004

Using Part-Of-Speech Tags For Predicting Phrase Breaks
I. Read and S.J. Cox
Proc. Int. Conf. on Spoken Language Processing, Korea, October 2004

Using Context to Correct Phone Recognition Errors
S.J. Cox
Proc. Int. Conf. on Spoken Language Processing, Korea, October 2004

Features and Classifiers for the Automatic Classification of Musical Audio Signals
K. West and S. Cox
Proc. 5th International Conference on Music Information Retrieval, (ISMIR 2004) Barcelona, October 2004

Can we modify existing automatic speech recognition technology to reliably and safely monitor respiration in patients sedated with Propofol?
L.Tan, S.J. Cox , G.D. Bell and M. Mansfield
British Society of Gastroenterology Annual Meeting, March, 2004, Glasgow

Automatic Call Routing with Multiple Language Models
Q. Huang and S.J. Cox
Human Language Technology /North American chapter of the Association for Computational Linguistics annual meeting: Proc. Workshop on Spoken Language Understanding for Conversational Systems, Boston, 2004

Dual Systems Processing and Translation at the Post Office: Reading the Signs
A. Wray, S.J. Cox, M. Lincoln and J. Tryggvason
Language & Communication, Vol. 24, no 1, pages 59--75, January 2004

Improving Phoneme Recognition of Telephone Quality Speech
Q. Huang and S.J. Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Montreal, May 2004.

2003

Unit Selection in Concatenative Text-to-Speech Synthesis System Based on Concatenation Costs Derived From Mel Filter Bank Amplitudes
T. Lambert, A. Breen, S. Cox and B.Milner
Proc. 8th European Conf. on Speech Communication and Technology, Geneva, September 2003

Call-Routing without Transcriptions
Q. Huang and S.J. Cox
Proc. 8th European Conf. on Speech Communication and Technology, Geneva, September 2003

Integrated Pitch and MFCC Extraction for Speech Reconstruction and Speech Recognition Applications
Xu Shao, Ben Milner and S.J. Cox
Proc. 8th European Conf. on Speech Communication and Technology, Geneva, September 2003

The Use of Confidence Measures in Vector Based Call-Routing
S.J. Cox and G.C. Cawley
Proc. 8th European Conf. on Speech Communication and Technology, Geneva, September 2003

Discriminative Techniques in Call Routing
S.J. Cox
IEEE Conf. on Acoustics, Speech and Signal Processing, Hong Kong, 2003

A Comparison of Language Processing Techniques for a Constrained Speech Translation System
M. Lincoln and S.J. Cox
IEEE Conf. on Acoustics, Speech and Signal Processing, Hong Kong, 2003

The Development and Evaluation of a Speech to Sign Translation System to Assist Transactions
S J Cox, M Lincoln, M J Nakisa, M. Wells, M. Tutt and S. Abbot.
Int. Journal of Human Computer Interaction, Vol 16 No 2, pages 141--161, October 2003.

2002

Speech and Language Processing for a Constrained Speech Translation System
S.J. Cox
Proc. Int.. Conf.. on Spoken Language Processing, Denver, September 2002

High-level Approaches to Confidence Estimation in Speech Recognition
S. Cox and S. Dasmahapatra.
IEEE Transactions on Speech and Audio, Vol 10, No 7, November 2002, pages 460--471

Extraction of Visual Features for Lipreading
I.A. Matthews, T. Cootes, J.A. Bangham S.J. Cox and R.W. Harvey
IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol 24, No 2, pages 198--213, February 2002

TESSA, a system to aid communication with deaf people
S.J. Cox, M. Lincoln, J. Tryggvason, M. Nakisa, M. Wells, M. Tutt and S. Abbott
Proc. ASSETS 2002, Fifth International ACM SIGCAPH Conference on Assistive Technologies, pages 205-- 212, July, 2002, Edinburgh, Scotland

Approaches to English to Sign Translation
S. Cox, E. Safar and I Marshall
The Linguist, February/March 2002, Vol. 41 No 1, pages 6--10

2001

A Comparison of some Different Techniques for Vector Based Call-Routing
S. Cox and B. Shahshahani
Proc. 7th European Conf. on Speech Communication and Technology, Aalborg, September 2001

Improved techniques for automatic call-routing
S. Cox and B. Shahshahani
Institute of Acoustics Workshop on Innovation in Speech Processing (WISP-2001), Stratford-upon-Avon, April 2001

2000

A Semantically-Based Confidence Measure for Speech Recognition
S. Cox and S. Dasmahapatra.
Proc. Int. Conf. On Spoken Language Processing, Beijing, 2000

Speaker Normalization in the MFCC Domain
S. Cox.
Proc. Int. Conf. On Spoken Language Processing, Beijing, 2000

Virtual Signing: Capture, Animation, Storage and Transmission?an Overview of the ViSiCAST Project
JA Bangham, SJ Cox, R Elliott, JRW Glauert, I Marshall, S.Rankov and M. Wells
Proc. IEE Colloquium on Speech and Language processing for the Disabled and Elderly, April 2000, pages 6/1--6/7

Signing for the Deaf using Virtual Humans
JA Bangham, SJ Cox, M. Lincoln, I Marshall, M. Tutt and M. Wells.
Proc. IEE Colloquium on Speech and Language processing for the Disabled and Elderly, April 2000

Meta-Models for Confidence Estimation in Speech Recognition
S. Dasmahapatra and S. Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Istanbul, 2000

1999

A Fast Method of Channel Equalisation for Speech Signals and its Implementation on a DSP
B. Theobald, S. Cox, G. Cawley and B. Milner
IEE Electronics Letters, Vol 35 No 16, August 1999, pages1309--1311

A High-level Approach to Confidence Estimation in Speech Recognition
S. Cox and S. Dasmahapatra.
Proc. 6th European Conf. on Speech Communication and Technology, Budapest, September 1999, pages 41--44

Run-length Distributions of Recursive Median Filters using Probabilistic Automata
O. Yli-Harja, I. Schmulevich, J. A. Bangham, R. Harvey, S. Dasmahapatra and S. Cox.
Proc. Scandinavian Conference on Image Analysis, SCIA'99, Kangerlussuaq, Greenland, June 1999

1998

Lip-reading using shape and scale
I.A. Matthews, J.A. Bangham, R.W. Harvey and S.J. Cox
Proc. Int. Conf on Auditory-Visual Speech Processing (AVSP?98), Terrigal, Australia, December 1998, pages 73-78

Non-linear Scale Decomposition Based Features for Visual Speech Recognition
I.A. Matthews, J.A. Bangham, R.W. Harvey and S.J. Cox
Proc. European Conference on Signal Processing (EUSIPCO), September 1998, pages 303-306

Techniques for accurate automatic annotation of speech waveforms
S.J. Cox, R. Brady and P. Jackson
Proc. Int. Conf. on Spoken Language Processing, Sydney, November 1998, pages1947-1950

Accent Identification using a Phonotactic Model
M. Lincoln, S. Cox and S. Ringland
Proc. Int. Conf. on Spoken Language Processing, Sydney, November 1998, pages109-112

Towards Speech Recogniser Assessment Using A Human Reference Standard
S.J. Cox, P.W.Linford, W.B. Hill and R.D.Johnston
Computer Speech and Language, Vol 12, no 4, pages 375-391, October 1998

A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition
I.A. Matthews, J.A. Bangham, R.W. Harvey and S.J. Cox
Proc. IEEE Computer Vision and Pattern Recognition Conference, Freiburg, May 1998

1997

Combining noise compensation and visual information in speech recognition
S.J. Cox, I.A Matthews and J.A. Bangham
Proceedings of the ESCA/ESCOP Workshop on Audio-Visual Speech Processing, Rhodes, September 1997, pages 53-56.

Evaluating feature set performance using the F-ratio and J-measures
S. Nicholson, B. Milner and S. Cox.
Proc. 5th European Conf. on Speech Communication and Technology, Rhodes, September 1997, pages 413-416.

A fast method of speaker normalisation using formant estimation
M. Lincoln, S. Cox and S. Ringland
Proc. 5th European Conf. on Speech Communication and Technology, Rhodes, September 1997, pages 2095-2098.

Towards a Rating Systems for Speech Recognisers
S.J.Cox, P.W.Linford, W.B. Hill and R.D.Johnston
Proc. SALT Workshop on Speech Technology Evaluation, University of Sheffield, June 1997.

Lip-reading from scale-space measurements
R.Harvey, I.Matthews, J.A. Bangham and S.J. Cox
Proc. IEEE Computer Vision and Pattern Recognition Conference, Puerto Rico, 1997

1996

Scale-based Features for AudioVisual Speech Recognition
I.Matthews, J.A. Bangham and S.J. Cox
Proc. IEE Colloquium on Integrated Audio-Visual Processing for Recognition Synthesis and Communication, November 1996, pages 8/1-8/7.

A Reference Standard for Speech Recognisers
S.J.Cox, P.W.Linford, I.Jennings and R.D.Johnston
Proc. Institute of Acoustics Autumn Conference, November 1996, pages 283-291

Audiovisual Speech Recognition using Multiscale Nonlinear Image Decomposition
I.Matthews, J.A. Bangham and S.J. Cox
Proc. Int. Conf. on Spoken Language Processing, Philadelphia, October 1996, pages 38-42

Confidence Measures for the SWITCHBOARD Database
S.J.Cox and R.C.Rose
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Vol 1, pages 511-515,

Atlanta, May 1996
1995

A Speaker Adaptation Technique using Linear Regression
S.J.Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Vol 1, pages 700-704, Detroit, May 1995

Predictive Speaker Adaptation in Speech Recognition
S.J.Cox
Computer Speech and Language, January 1995, Vol 9, pages 1-17

1994

TLM Modelling of Thermal Transfer in Heat Exchangers
D.de Cogan, S.J.Cox and K.O Chichlowski
Proc. Eurotherm Seminar no 36, Poitiers, September 1994

Performance of Human Listeners on an Alphabetic Speech Recognition Task
S.J.Cox, P.W.Linford, K.O Chichlowski and R.D.Johnston
Proc. Institute of Acoustics Autumn Conference, November 1994, pages 23-30

1993

Speaker adaptation using a Predictive Model
S.J.Cox
Proc. Eurospeech 93, Berlin, 1993, pages 2283--2286

pre-1993

Speaker adaptation in Speech Recognition using Linear Regression Techniques
S.J.Cox
Electronics Letters, Vol 28, No 22, October 1992, pages 2093--2094

Generation of Mouthshapes for a Talking Head
S.J.Cox and A.Simons
Proc. Institute of Acoustics Conference on Speech and Hearing, Windermere, November 1990

RECNORM: Simultaneous normalisation and classification applied to speech recognition.
S.J.Cox and J.S.Bridle
In Advances in Neural Information Processing Systems, D Tourzetsky (ed), Morgan-Kaufmann, San Mateo, 1991.

Simultaneous speaker normalisation and word recognition using neural networks/Bayesian techniques.
S.J.Cox and J.S.Bridle
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Albuquerque, April 1990

Hidden Markov models for automatic speech recognition: theory and application.
S.J.Cox
In Speech and Language Processing, Wheddon and Linggard (eds.), Chapman and Hall 1990

Some statistical issues in the comparison of speech recognition algorithms.
L.Gillick and S.J.Cox
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Glasgow, May 1989, pages 532--535

Unsupervised speaker adaptation by probabilistic spectrum fitting.
S.J.Cox and J.S.Bridle
Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Glasgow, May 1989, pages 294-297