On the Role of Prediction in Streaming Hierarchical Learning