Thiyanga S. Talagala

PhD in Statistics

Monash University, Australia

I am a Senior Lecturer in the Department of Statistics, Faculty of Applied Sciences at the University of Sri Jayewardenepura. I received my PhD in statistics from Monash University, Australia in 2019. My thesis advisors were Professor Rob J Hyndman and Professor George Athanasopoulos.

I was an Associate Investigator of the Australian Research Council (ARC) Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS).

I am a co-founder and co-organizer of RLadies-Colombo, Sri Lanka, a local chapter of R-Ladies Global, an organization that promotes diversity in the R community worldwide. I am the coordinator of the statistical consulting service, University of Sri Jayewardenepura.

I enjoy solving general data science problems from three different angles: theoretical, computational, applied. On this website you will find some of my work and interests in statistics and data analysis. My research focuses on developing new statistical machine learning tools to help both practitioners and theoreticians make more open, explainable and reproducible data-driven discoveries. I am also interested in R and Python programming language.

Priyanga Dilini Talagala PhD in Statistics, Monash University, Australia is my sister.


  • Time Series Analysis
  • Data Visualization
  • Computational Statistics
  • Machine Learning
  • Machine Learning Interpretability
  • Applied Statistics
  • Algorithm Selection


  • PhD in Statistics, 2019

    Monash University, Australia

  • MSc in Financial Mathematics, 2015

    University of Moratuwa, Sri Lanka

  • BSc (Hons) Special Degree in Statistics, 2011

    University of Sri Jayewardenepura, Sri Lanka

  • Professor R A Dayananda Gold Medalist and Batch first, 2011

Data Import

tea: R package for tea exporting countries

mozzie: R package for dengue cases in Sri Lanka CRAN\_Status\_Badge

colmozzie: R package for dengue cases and climate variables in Colombo Sri Lanka CRAN_Status_Badge

m4comp2018: R package for M4 Competition time series data

DSjobtracker: R package containing information related to data science job advertisements. What skills and qualifications are required for data science related jobs? CRAN_Status_Badge

MedLEA: The MedLEA package provides morphological and structural features of 471 medicinal plant leaves and 1099 leaf images of 31 species and 29-45 images per species. CRAN\_Status\_Badge

ceylon: An R package to plot maps of Sri Lanka

covid19srilanka: An R package to get tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) epidemic in Sri Lanka.



Small Bite-Big threat = Small Data-Big Impact

Potential impacts of climate change on dengue fever

Introduction to Python

Python Instructional Resources

Data of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) epidemic in Sri Lanka

A tidy format dataset of the 2019 Novel Coronavirus COVID-19 (2019-nCoV) epidemic in Sri Lanka

Statistical Machine Learning for Medicinal Plant Identification

MEDIPI, a statistical machine learning algorithm for medicinal plant identification and a leaf image database for plant classification.

Teaching Statistics

Data sets for teaching data analysis.

R-Ladies Colombo

R-Ladies is a worldwide organization whose mission is to promote diversity in the R community.

Large-Scale Time Series Forecasting

Computationally efficient forecasting methods for large-scale real-time applications

Programming and Data Analysis with R

The course website for my teaching unit STA 326 2.0 Programming and Data Analysis with R


RMarkdown: Create insightful reports in R

Why R? 2021: Keynote

The Why R? Foundation held its Why R? 2021 Conference - the fifth meeting of Central-Eastern-European in December 2021. I was honored …

Forecasting Model Territories

Abstract The field of time series forecasting has been evolving rapidly with advances in techniques for modelling and forecasting. …

Large-Scale Time Series Forecasting

A Tool to Detect Potential Data Leaks in Forecasting Competitions

Abstract A Tool to Detect Potential Data Leaks in Forecasting Competitions Forecasting competitions are of increasing importance as a …


My Posit::conf 2023 Experience - A Conference of Insights, Knowledge, Connections, and Explorations

The Posit::conf 2023 (formerly known as RStudio conference) was held in Hyatt Regency in Chicago, Illinois, USA from 16 to 20 …

Some useful functions to wrangle with time series data

Time series data wrangling

Building a website using Quarto

In this post, you will learn how to build a website using Quarto.

Error: C stack usage is too close to the limit

Delete .Rprofile

Logistic Regression: Model Building and Interpretation

Logistic regression is a widely used modelling approach, however little is known about the modelling processes and interpretation of …


  • ttalagala@sjp.ac.lk