nltk PyPDF2 pandas matplotlib seaborn spacy transformers torch scikit-learn