I am a PhD student at CDT in NLP, University of Edinburgh. My advisor is Prof. Mirella Lapata and my research area is natural language processing. My current focus is on discourse processing for conversational semantic parsing. I am interested in investigating how to represent interaction context (i.e., discourse) in tandem with task dependent context, how to learn such representation automatically, and how to effectively use such representation for end task.
Before this, I was a research engineer at IBM Research. At IBM, I worked on structured data to text generation. Our focus was on developing systems which are scalable and controllable. We presented a tutorial at ACL 2019 on this topic. I also worked as a software engineer at Amazon for a year.
I graduated from the Indian Institute of Technology, Hyderabad with an M.Tech in Computer Science and Engineering where I was advised by Prof. Vineeth N Balasubramanian. At IITH, I worked on Metric Learning in a streaming data setup. To avoid heavy computation as data arrives, we proposed unsupervised information theoretic metric learning and an incremental version of diffusion maps. I also worked with Dr TVS Udaya Bhaskar on ocean 🌊 biome classification by learning a distance metric for clustering.
I got my undergraduate degree in Computer Science from Jabalpur Engineering College where I enjoyed building line () following robots and configuring routers.
When I am not in working, I enjoy listening to Indian Classical Music or play violin 🎻.
* indicate equal contribution
Semantic Parsing for Conversational Question Answering over Knowledge Graphs
Perez-Beltrachini, L., Jain, P. , Monti, E., Lapata, M., Dataset
Memory-Based Semantic Parsing
Jain, P., Lapata, M., (TACL) 2021
Bootstrapping Chatbot Interfaces to Databases
Mittal, A., Saha, D., Jain, P., Sen, J., Jammi, M., Sankaranarayanan, K., (8th ACM IKDD CODS and 26th COMAD), 2021
Unified Semantic Parsing with Weak Supervision
Agrawal, P., Jain, P., Dalmiya, A., Bansal, A. Mittal, A., Sankaranarayanan, K. (ACL 2019)
Creation and Interaction with Large-scale Domain-Specific Knowledge Bases
Bharadwaj, S. et al. [and others, including Jain, P.] (VLDB 2017, Demonstrations Track) [Video]
Scalable Micro-planned Generation of Discourse from Structured Data
Laha, A.*, Jain, P.*, Mishra, A.*, Sankaranarayanan, K. (Computational Linguistics Journal), 2019
✨In news: Powering Match Insights for 🎾 US Open by interfacing with structured knowledge bases. TOI Gadgetsnow IBM Article
A Mixed Hierarchical Attention based Encoder-Decoder Approach for Standard Table Summarization
Jain, P., Laha, A., Sankaranarayanan, K., Nema, P., Khapra, M.M. and Shetty, S., (NAACL-HLT 2018)
Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization
Nema, P.*, Jain, P.*, Shetty, S., Laha, A., Sankaranarayanan, K. and Khapra, M.M., (NAACL-HLT 2018) [Video]
Story Generation from Sequence of Independent Short Descriptions
Jain, P., Agrawal, P., Mishra, A., Sukhwani, M., Laha, A. and Sankaranarayanan, K., (SIGKDD Workshop ML4Creativity, 2017)
Unsupervised Controllable Text Formalization
Jain, P., Mishra, A., Azad, A.P. and Sankaranarayanan, K. (AAAI 2019) [Video]
Unsupervised Neural Text Simplification
Surya, S., Mishra, A., Laha, A., Jain, P., Sankaranarayanan, K. (ACL 2019)
Online Active Metric Learning for Clustering
Jain, P., Balasubramanian, V.N. (XRCI OPEN 2016) [Poster]
Metric Learning for Clustering in Streaming Large-Scale Data
Jain, P. (IIT Hyderabad, 2015) Thesis
(Invited Talk) Conversational Semantic Parsing - Online International. Conference. On. Advances in Physical, Mathematical and Computational Sciences, India - 2022
Storytelling from Structured Data and Knowledge Graphs : An NLG Perspective, Tutorial ACL 2019 [Website]
Introduction to Machine Learning - Christ University Bangalore, 2018
Generating Descriptions from Structured Data NAACL 2018 [Video]
Talk on Online Metric learning (IBM Research Bangalore, 2016) [Slides]
Tutor for Natural Language Understanding, Generation, and Machine Translation, University of Edinburgh (2020-2021)
TA for Numerical Linear Algebra for Data Analysis (CS5270, 2015), IIT Hyderabad
TA for Introduction to Database Management Systems (CS3010/CS3011, 2014), IIT Hyderabad
TA for Advanced Compiler Design (CS6240, 2014), IIT Hyderabad