Photo of Prasenjit Mitra


413E Eric J. Barron Innovation Hub
State College, PA 16801
(814) 865-2379
Ph.D., Electrical Engineering Stanford University, California, 2004
M.S., Computer Science The University of Texas at Austin, Texas, 1994
B.Tech., (Honours), Computer Science and Engineering, India Indian Institute of Technology, Kharagpur, 1993

Prasenjit Mitra is a professor in the College of Information Sciences and Technology in the area of Data Sciences and Artificial Intelligence. His current research interests are in the areas of applied machine learning, natural language processing, big data analytics, and visual analytics especially in application areas such as social media analytics, digital libraries, medical informatics, wildlife informatics, sports analytics, etc. In the past, he has contributed to the areas of data interoperation, data cleaning, and digital libraries especially in tabular data extraction, information retrieval, and citation recommendation.

Mitra received his Ph.D. from Stanford University in 2004, where he investigated issues related to modeling data and the semantics of data in an information integration system. At Penn State, he has pursued research on a broad range of topics ranging from artificial intelligence and machine learning to data mining on the web and social media, scalable data cleaning, political text mining, chemical formula and name extraction from documents, and the extraction of data and metadata from figures and tables in digital documents.

He was the principal investigator of the DOES project funded by the NSF CAREER Award. He has also been the co-principal investigator of the CiteSeerX, ChemXSeer, and ArchSeer digital library projects, the Regional Visualization and Analytics Center (NEVAC), and the GeoCAM visual analytics projects. Mitra serves as the director of the Cancer Informatics Initiative at Penn State. His research has been supported by the NSF, Microsoft Corporation, DoD, DHS, DoE, NGA, and DTRA.

Mitra obtained his Bachelor of Technology, with honors from the Indian Institute of Technology, Kharagpur in 1993. In 1994, he obtained an M.S. in Computer Science from The University of Texas at Austin. From 1995 to 2000, he worked at the Server Technologies Division at Oracle Corporation as a Senior Member on the Oracle Parallel Server in the Languages and Relational Technologies group. He has served as a consultant for several startups including the Board of Advisors of Global IDs, Inc.

Mitra has co-authored approximately 150 articles at top conferences and journals. His work along with his co-authors has resulted in a visual analytics system that was awarded the IEEE VAST '08 Grand Challenge award in the Data Integration area. He has served as the co-chair of the IEEE SOCIETY conference, and as an area chair, and a senior program committee member at top conferences such as CIKM, and IJCAI, respectively. Mitra has been a member of the Best Paper Award committee for CIKM'15 and the co-chair of four workshops including SNAKDD'09, WIDM'09, and WIDM'12. He has also served on the program committee of several top conferences including SIGMOD, VLDB, AAAI, IJCAI, WWW, CIKM, WSDM, KDD, and ICDM, and serves on the editorial board of the Journal of Data Mining and Digital Humanities. He has supervised over 15 Ph.D. students; and several M.S. students. 

Research Keywords
Artificial Intelligence
Applied Machine Learning
Natural Language Processing
Big Data Analytics
Data Science
Information Retrieval
Visual Analytics
Digital Libraries