ex-man / GeneralInsurance_Class

This includes a simple training for general insurance business designed for insurance students
4 stars 50 forks source link

L4_Intro: Using past to predict future #36

Open ex-man opened 6 years ago

ex-man commented 6 years ago

In this lesson, the data will be analysed further to find a GLM model, that is consistent over time and represents a good fit for some years of data. Concepts of validation and other diagnostics may be introduced.

ex-man commented 6 years ago
ex-man commented 6 years ago

found this reasonable visual of glm... hope it helps showing what it is... http://www.brainvoyager.com/bvqx/doc/UsersGuide/StatisticalAnalysis/TheGeneralLinearModel.html

cvoky commented 6 years ago

Found little bit strange thing or I do not understand data well.

What is the right level of policy data to ensure correct merging with claims data? I was expecting to have unique observetion per NrPolicy + NrObject.

Found duplicates in identifiers, see below: screen shot 2018-03-18 at 17 54 03

Please explain. For now excluding those duplicates.


Another strange thing to me is why one claim can be linked to two different policy no. ? screen shot 2018-03-18 at 18 15 04

Please explain. For now excluding those duplicates.

ex-man commented 6 years ago

Both can happen. 2nd one if there are 2 policyholders that hit each other in one accident. 1st, if there are more insureds on one policy (but likely an error in data, unless this si after join, in which case there were 2 claims, which is fine)

cvoky commented 6 years ago

Thanks! Make sense. For now I would suggest to not include those cases, so we can more focus to the core of the lesson.

What do you think?

ex-man commented 6 years ago

Sounds good. Feel free to remove.