An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels