12  SPLICE

This article was written by Greg Taylor and Sarah MacDonnell originally published here on 14 November 2022

The R data simulation package SynthETIC has been overtaken by an updated version, SPLICE (Synthetic Paid Loss and Incurred Cost Experience).

SPLICE, whilst still based on SynthETIC, has now been extended to simulate case estimates, and hence incurred claims. It can be accessed on CRAN, along with other relevant resources including a reference manual.

SPLICE is a useful tool for producing simulated datasets for testing out various reserving, including machine learning, methods. It generates datasets of triangles, as well as individual claims transactions, showing paid and incurred developments by occurrence as well as notification and settlement times.

The user can set up specific features in the datasets. For example in the Al-Mudafer thesis which we have previously featured, 4 different types of claims triangles were generated:

The MLR WP used these same datasets in our GIRO 21 workshop Machine Learning Reserving on Triangle Data.

We recommend using SPLICE over SynthETIC going forward as we understand that SPLICE is the one that will continue to be supported and updated.

Our previous article introducing SynthETIC can be found here.