Øystein Sørensen

Professor of Biostatistics,

University of Oslo

I professor of biostatistics at the Department of Psychology, University of Oslo. My research interests include latent variable modeling, rank and preference models, and development of statistical software.

This website summarizes my main statistical publications and research software. Click on the icons to the left to see my full publication list in Google Scholar and all my software projects at GitHub.

Education

PhD in Statistics, 2015

University of Oslo
MSc in Biological Physics, 2011

Norwegian University of Life Sciences

Interests

Latent variable modeling
Computation
Statistical software

Publications

Longitudinal modeling of age-dependent latent traits with generalized additive latent and mixed models

Øystein Sørensen, Anders M. Fjell, Kristine B. Walhovd

February 2023 Psychometrika

We present generalized additive latent and mixed models (GALAMMs) for analysis of clustered data with responses and latent variables depending smoothly on observed variables. A scalable maximum likelihood estimation algorithm is proposed, utilizing the Laplace approximation, sparse matrix computation, and automatic differentiation. Mixed response types, heteroscedasticity, and crossed random effects are naturally incorporated into the framework. The models developed were motivated by applications in cognitive neuroscience, and two case studies are presented.

DOI

A recipe for accurate estimation of lifespan brain trajectories, distinguishing longitudinal and cohort effects

Øystein Sørensen, Kristine B. Walhovd, Anders M. Fjell

February 2021 NeuroImage

We address the problem of estimating how different parts of the brain develop and change throughout the lifespan, and how these trajectories are affected by genetic and environmental factors. Estimation of these lifespan trajectories is statistically challenging, since their shapes are typically highly nonlinear, and although true change can only be quantified by longitudinal examinations, as follow-up intervals in neuroimaging studies typically cover less than 10% of the lifespan, use of cross-sectional information is necessary.

DOI

Meta-analysis of generalized additive models in neuroimaging studies

Øystein Sørensen, Andreas M.Brandmaier, Dídac Macià, Klaus Ebmeier, Paolo Ghisletta, Rogier A. Kievit, Athanasia M.Mowinckel, Kristine B. Walhovd, Rene Westerhausen, Anders Fjell

January 2021 NeuroImage

Analyzing data from multiple neuroimaging studies has great potential in terms of increasing statistical power, enabling detection of effects of smaller magnitude than would be possible when analyzing each study separately and also allowing to systematically investigate between-study differences. Restrictions due to privacy or proprietary data as well as more practical concerns can make it hard to share neuroimaging datasets, such that analyzing all data in a common location might be impractical or impossible.

DOI

BayesMallows: An R Package for the Bayesian Mallows Model

Øystein Sørensen, Marta Crispino, Qinghua Liu, Valeria Vitelli

September 2020 The R Journal

BayesMallows is an R package for analyzing preference data in the form of rankings with the Mallows rank model, and its finite mixture extension, in a Bayesian framework. The model is grounded on the idea that the probability density of an observed ranking decreases exponentially with the distance to the location parameter. It is the first Bayesian implementation that allows wide choices of distances, and it works well with a large amount of items to be ranked.

DOI

From observed laterality to latent hemispheric differences: Revisiting the inference problem

Øystein Sørensen, René Westerhausen

May 2020 Laterality

Researchers interested in hemispheric dominance frequently aim to infer latent functional differences between the hemispheres from observed lateral behavioural or brain-activation differences. To be valid, these inferences may not only rely on the observed laterality measures but also need to account for the antecedent probabilities of the studied latent classes. This fact is frequently ignored in the literature, leading to misclassifications especially when considering low probability classes as, for example, “atypical” right hemispheric language dominance.

DOI

hdme: High-Dimensional Regression with Measurement Error

Øystein Sørensen

May 2019 Journal of Open Source Software

This is the companion paper to the hdme R package. Link to paper.

DOI

Probabilistic preference learning with the Mallows rank model

Valeria Vitelli, Øystein Sørensen, Marta Crispino, Arnoldo Frigessi, Elja Arjas

April 2018 Journal of Machine Learning Research

Ranking and comparing items is crucial for collecting information about preferences in many areas, from marketing to politics. The Mallows rank model is among the most successful approaches to analyze rank data, but its computational complexity has limited its use to a particular form based on Kendall distance. We develop new computationally tractable methods for Bayesian inference in Mallows models that work with any right-invariant distance. Our method performs inference on the consensus ranking of the items, also when based on partial rankings, such as top-k items or pairwise comparisons.

Covariate Selection in High-Dimensional Generalized Linear Models With Measurement Error

Øystein Sørensen, Kristoffer Herland Hellton, Arnoldo Frigessi, Magne Thoresen

January 2018 Journal of Computational and Graphical Statistics

In many problems involving generalized linear models, the covariates are subject to measurement error. When the number of covariates p exceeds the sample size n, regularized methods like the lasso or Dantzig selector are required. Several recent papers have studied methods which correct for measurement error in the lasso or Dantzig selector for linear models in the p > n setting. We study a correction for generalized linear models, based on Rosenbaum and Tsybakov’s matrix uncertainty selector.

DOI

Measurement error in Lasso: Impact and likelihood bias correction

Øystein Sørensen, Arnoldo Frigessi, Magne Thoresen

April 2015 Statistica Sinica

Regression with the lasso penalty is a popular tool for performing dimension reduction when the number of covariates is large. In many applications of the lasso, like in genomics, covariates are subject to measurement error. We study the impact of measurement error on linear regression with the lasso penalty, both analytically and in simulation experiments. A simple method of correction for measurement error in the lasso is then considered. In the large sample limit, the corrected lasso yields sign consistent covariate selection under conditions very similar to the lasso with perfect measurements, whereas the uncorrected lasso requires much more stringent conditions on the covariance structure of the data.

DOI

Software

BayesianLaterality

Sep 22, 2020

R package available from CRAN. See also the accompanying Shiny App. Functional differences between the cerebral hemispheres are a fundamental characteristic of the human brain. Researchers interested in studying these differences often infer underlying hemispheric dominance for a certain function (e.g., language) from laterality indices calculated from observed performance or brain activation measures. However, any inference from observed measures to latent (unobserved) classes has to consider the prior probability of class membership in the population.

metagam

Feb 20, 2020

R package available from CRAN. Meta-analysis of generalized additive models and generalized additive mixed models. A typical use case is when data cannot be shared across locations, and an overall meta-analytic fit is sought. ‘metagam’ provides functionality for removing individual participant data from models computed using the ‘mgcv’ and ‘gamm4’ packages such that the model objects can be shared without exposing individual data. Furthermore, methods for meta-analysing these fits are provided.

BayesMallows

Oct 8, 2018

R package available from CRAN. An implementation of the Bayesian version of the Mallows rank model. Both Cayley, footrule, Hamming, Kendall, Spearman, and Ulam distances are supported in the models. The rank data to be analyzed can be in the form of complete rankings, top-k rankings, partially missing rankings, as well as consistent and inconsistent pairwise preferences. Several functions for plotting and studying the posterior distributions of parameters are provided. The package also provides functions for estimating the partition function (normalizing constant) of the Mallows rank model, both with the importance sampling algorithm of Vitelli et al.

hdme

Mar 9, 2018

R package available from CRAN.

Penalized regression for generalized linear models for measurement error problems (aka. errors-in-variables). The package contains a version of the lasso (L1-penalization) which corrects for measurement error. It also contains an implementation of the Generalized Matrix Uncertainty Selector, which is a version the (Generalized) Dantzig Selector for the case of measurement error.

Experience

Professor of Biostatistics

University of Oslo

Jan 2024 – Present Oslo

Department of Psychology.

Associate Editor

Journal of Open Source Software

Jun 2021 – Present Oslo

Associate Professor of Statistics

University of Oslo

Sep 2018 – Dec 2023 Oslo

Working at the Center for Lifespan Changes in Brain and Cognition, Department of Psychology.

Postdoctoral Research

University of Oslo

Jun 2018 – Aug 2018 Oslo

Working at the Oslo Center for Biostatistics and Epidemiology (OCBE).

Data Scientist

NextBridge Analytics

Aug 2016 – May 2018 Oslo

Data science consultant.

Analyst

Storebrand Life Insurance

Nov 2014 – Jul 2016 Oslo

Market analyst in the B2B market.

PhD Student

University of Oslo

May 2011 – Oct 2014 Oslo

PhD student at the Department of Biostatistics, with 25% teaching.

Øystein Sørensen

Professor of Biostatistics,

Education

Interests

Recent Posts

Publications

Software

Experience

Professor of Biostatistics

Associate Editor

Associate Professor of Statistics

Postdoctoral Research

Data Scientist

NextBridge Analytics

Analyst

Storebrand Life Insurance

PhD Student

Popular Topics