Publications
Publications (and the unpublished)
- When Annotators Disagree: A Principled Approach to Learning with Noisy Labels with Bucarelli Maria Sofia, Purifacto Antonio, Lucas Cassano, Andrea Bacciu, Federico Siciliano, Amin Mantrach, and Fabrizio Silvestri in IEEE Transactions on AI 2026.
- The Majority Vote Paradigm Shift: When Popular Meets Optimal with Bucarelli Maria Sofia, Purifacto Antonio, Andrea Bacciu, Amin Mantrach, and Fabrizio Silvestri in AISTATS 2026.
- ParrotTTS: Text-to-speech synthesis exploiting disentangled self-supervised representations with Neil Shah, Saiteja Kosgi, Vishal Tambrahallia, Neha Sahipjohn, Niranjan Pedanekar, and Vineet Gandhi in EACL 2024. On Arxiv.
- Empathic machines: using intermediate features as levers to emulate emotions in text-to-speech system with Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Vineet Gandhi in NAACL 2022 On ACL.
- Interactive post-editing for verbosity controlled translation with Prabhakar Gupta, Anil Nelakanti, Grant M. Berry, Abhishek Sharma, in COLING 2022. On ACL.
- Adapting neural machine translation for automatic post-editing with Abhishek Sharma, Prabhakar Gupta, in Conference on Machine Translation (WMT) 2021. On ACL.
- Object-level context modeling for scene classification with Context-CNN with Syed Ashar Javed in CVPR Workshsop 2017. On arxiv.
- Structured penalties for log-linear language models with Cedric Archambeau, Julien Mairal, Francis Bach and Guillaume Bouchard, in EMNLP 2013. On ACL. Oral slides.
- Tree learning strategies for large-scale taxonomies with Cedric Archambeau, Francis Bach and Guillaume Bouchard. Draft.
- Generalized linear language models with Cedric Archambeau, Francis Bach and Guillaume Bouchard. Draft.
- Planar scene modeling from quasiconvex subproblems with Visesh Chari, Chetan Jakkoju, C.V. Jawahar in ACCV 2009. On ACM.
- Path planning for visual servoing and navigation using convex optimization with Abdul Hafez, C.V. Jawahar, in the Journal of Robotics and Automation, 2014. On web.
- Path planning approach to visual servoing:convex optimization based solution with Abdul Hafez, C.V. Jawahar, in IROS 2008. On IEEE.
Patents granted
- Salient region detection in digital entertainment content, granted in US.
- Audio-lip movement correlation measurement for dubbed content, granted in US.
- Emotion mismatch detection for autodubs, granted in US to Amazon Technologies.
- Song generation using neural network granted in US to Amazon Technologies.
- Voice content selection for video content, filed in US to Amazon Technologies.
- Automated quality assessment of translations, granted in US to Amazon Technologies.
- Language model with structured penalty, granted in US and EU to Xerox Corp.
- (Filed and pending) Language agnostic song detection and identification, filed in US.