I have a database with many resumes, including structured data by gender, age, address, number of years of study and many other parameters of each person.
At about 10% of the sample, I also have additional data about a specific action that they did at a particular point in time. For example, Jane took a mortgage in July 1998 or that John began pilot training in January 2007 and received a license in December 2007.
I need an algorithm that gives for each of the actions the likelihood that this will happen for each person in the future. For example, the probability that Bill will take a mortgage will be 2% in 2011, 3.5% in 2012, etc.
How do I approach this? Regression analysis? SVM? Neural network? Something else?
Maybe even some standard tool / library that I can only use with obvious settings?
source share