Integrating Multiple Datasets With Missing Records And Partially Overlapping Feature Sets: Applications To Healthcare Prediction Tasks