Skip to main content

Center Mission

As part of Cornell’s radical collaborations, the Center for Data Science for Enterprise and Society aims to unify programs and curricula in data science with an initial emphasis on questions grounded in data that are generated by human activity, including computational social science (e.g., sociology and government), the economics/computer science interface, aspects of digital agriculture in the production and management of agriculture, digital platforms supporting urban infrastructure (e.g., the sharing economy), and as a theme that is cross-cutting in many of these areas, the corresponding issues of privacy, security, and fairness; more generally, the Center enhances other programmatic areas associated with data science in an entrepreneurial and opportunistic fashion.

CALL FOR PROPOSALS Microsoft Azure Cloud Computing Grant – final call for 23/24 academic year. $50K in credits available to Cornell faculty to be used through June 30th. Apply by April 15. https://datasciencecenter.cornell.edu/microsoft-azure-credits/  

This talk is co-sponsored by the Department of Statistics and Data Science and the Center for Data Science for Enterprise and Society. Abstract: In this talk we will describe two applications of empirical Bayes (EB) methodology. EB procedures estimate the prior probability distribution in a latent variable model or Bayesian model from the data. In the first part we study the (Gaussian) signal plus noise model with multivariate, heteroscedastic errors. This model arises in many large-scale denoising problems (e.g., in astronomy). We consider the nonparametric maximum likelihood estimator (NPMLE) in this setting. We study the characterization, uniqueness, and computation of the NPMLE which estimates the unknown (arbitrary) prior by solving an infinite-dimensional convex optimization problem. The EB posterior means based on the NPMLE have low regret, meaning they closely target the oracle posterior means one would compute with the true prior in hand. We demonstrate the adaptive and near-optimal properties of the NPMLE for density estimation, denoising and deconvolution. In the second half of the talk, we consider the problem of Bayesian high dimensional regression where the regression coefficients are drawn i.i.d. from an unknown prior. To estimate this prior distribution, we propose and study a "variational empirical Bayes" approach — it combines EB inference with a variational approximation (VA). The idea is to approximate the intractable marginal log-likelihood of the response vector --- also known as the "evidence" --- by the evidence lower bound (ELBO) obtained from a naive mean field (NMF) approximation. We then maximize this lower bound over a suitable class of prior distributions in a computationally feasible way. We show that the marginal log-likelihood function can be (uniformly) approximated by its mean field counterpart. More importantly, under suitable conditions, we establish that this strategy leads to consistent approximation of the true posterior and provides asymptotically valid posterior inference for the regression coefficients.   Bio: Bodhi Sen is a Professor of Statistics at Columbia University, New York. He completed his Ph.D in Statistics from University of Michigan, Ann Arbor, in 2008. Prior to that, he was a student at the Indian Statistical Institute, Kolkata, where he received his Bachelors (2002) and Masters (2004) in Statistics. His core statistical research centers around nonparametrics --- function estimation (with special emphasis on shape constrained estimation), theory of optimal transport and its applications to statistics, empirical Bayes procedures, kernel methods, likelihood and bootstrap based inference, etc. He is also actively involved in interdisciplinary research, especially in astronomy. His honors include the NSF CAREER award (2012), and the Young Statistical Scientist Award (YSSA) in the Theory and Methods category from the International Indian Statistical Association (IISA). He is an elected fellow of the Institute of Mathematical Statistics (IMS).  

The Econometric Society Interdisciplinary Frontiers (ESIF) conference on Economics and AI+ML will be hosted by Cornell University, in Ithaca NY, on August 13-14. The purpose of the meeting is to foster interaction of ideas and methodologies from the areas of Computer Science and Economics (broadly defined, but with emphasis on AI and ML). The conference will feature keynote lectures and parallel sessions, bringing together scholars from both fields.   VISIT THE CONFERENCE WEBSITE SUBMIT PAPERS (By February 25)    

Cornell researchers have developed two technologies that track a person’s gaze and facial expressions through sonar-like sensing. 

April 10, 2024

AI memory aids, post-apocalyptic video games and a stock trading app are among the digital creations that will be on display at Bits On Our Minds, the premier showcase for Cornell student projects in cutting-edge digital technology. 

April 8, 2024

Researchers have developed a wristband device that continuously detects hand positioning – as well as objects the hand interacts with – using AI-powered, inaudible soundwaves.

April 2, 2024