Spaces:

joeddav
/

zero-shot-demo

Runtime error

App Files Files Community

Joe Davison commited on Jul 9, 2020

Commit

a922691

0 Parent(s):

initial (again)

Browse files

Files changed (5) hide show

app.py +167 -0
huggingface_logo.png +0 -0
requirements.txt +85 -0
style.css +4 -0
texts.json +21 -0

app.py ADDED Viewed

	@@ -0,0 +1,167 @@

+import streamlit as st
+from transformers import BartForSequenceClassification, BartTokenizer
+import torch
+import numpy as np
+import contextlib
+import plotly.express as px
+import pandas as pd
+from PIL import Image
+import datetime
+with open("hit_log.txt", mode='a') as file:
+    file.write(str(datetime.datetime.now()) + '\n')
+MODEL_DESC = {
+    'Bart MNLI': """Bart with a classification head trained on MNLI.\n\nSequences are posed as NLI premises and topic labels are turned into premises, i.e. `business` -> `This text is about business.`""",
+    'Bart MNLI + Yahoo Answers': """Bart with a classification head trained on MNLI and then further fine-tuned on Yahoo Answers topic classification.\n\nSequences are posed as NLI premises and topic labels are turned into premises, i.e. `business` -> `This text is about business.`""",
+}
+ZSL_DESC = """Recently, the NLP science community has begun to pay increasing attention to zero-shot and few-shot applications, such as in the [paper from OpenAI](https://arxiv.org/abs/2005.14165) introducing GPT-3. This demo shows how 🤗 Transformers can be used for zero-shot topic classification, the task of predicting a topic that the model has not been trained on."""
+CODE_DESC = """```python
+# pose sequence as a NLI premise and label as a hypothesis
+from transformers import BartForSequenceClassification, BartTokenizer
+nli_model = BartForSequenceClassification.from_pretrained('bart-large-mnli')
+tokenizer = BartTokenizer.from_pretrained('bart-large-mnli')
+premise = sequence
+hypothesis = f'This text is about {label}.'
+# run through model pre-trained on MNLI
+x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
+                        max_length=tokenizer.max_len,
+                        truncation_strategy='only_first')
+logits = nli_model(x.to(device))[0]
+# we throw away "neutral" (dim 1) and take the probability of
+# "entailment" (2) as the probability of the label being true
+entail_contradiction_logits = logits[:,[0,2]]
+probs = entail_contradiction_logits.softmax(1)
+prob_label_is_true = probs[:,1]
+```"""
+model_ids = {
+    'Bart MNLI': 'bart-large-mnli',
+    'Bart MNLI + Yahoo Answers': './bart_mnli_topics'
+}
+device = torch.device('cuda') if torch.cuda.is_available() else torch.device('cpu')
+@st.cache(allow_output_mutation=True)
+def load_models():
+    return {id: BartForSequenceClassification.from_pretrained(id).to(device) for id in model_ids.values()}
+models = load_models()
+@st.cache(allow_output_mutation=True)
+def load_tokenizer(tok_id):
+    return BartTokenizer.from_pretrained(tok_id)
+@st.cache(allow_output_mutation=True, show_spinner=False)
+def classify_candidate(nli_model_id, sequence, label, do_print_code):
+    nli_model = models[nli_model_id]
+    tokenizer = load_tokenizer('bart-large')
+    # pose sequence as a NLI premise and label as a hypothesis
+    premise = sequence
+    hypothesis = f'This text is about {label}.'
+    # run through model pre-trained on MNLI
+    x = tokenizer.encode(premise, hypothesis, return_tensors='pt',
+                            max_length=tokenizer.max_len,
+                            truncation_strategy='only_first')
+    with torch.no_grad():
+        logits = nli_model(x.to(device))[0]
+    # we throw away "neutral" (dim 1) and take the probability of
+    # "entailment" (2) as the probability of the label being true
+    entail_contradiction_logits = logits[:,[0,2]]
+    probs = entail_contradiction_logits.softmax(1)
+    prob_label_is_true = probs[:,1]
+    return prob_label_is_true.cpu()
+def get_most_likely(nli_model_id,  sequence, labels, do_print_code):
+    predictions = []
+    for label in labels:
+        predictions.append(classify_candidate(nli_model_id, sequence, label, do_print_code))
+        do_print_code = False #only print code once per run
+    predictions = torch.cat(predictions)
+    most_likely = predictions.argsort().numpy()
+    top_topics = np.array(labels)[most_likely]
+    scores = predictions[most_likely].detach().numpy()
+    return top_topics, scores
+@st.cache(allow_output_mutation=True)
+def get_sentence_model(model_id):
+    return SentenceTransformer(model_id)
+def load_examples():
+    df = pd.read_json('texts.json')
+    names = df.name.values.tolist()
+    mapping = {df['name'].iloc[i]: (df['text'].iloc[i], df['labels'].iloc[i]) for i in range(len(names))}
+    names.append('Custom')
+    mapping['Custom'] = ('', '')
+    return names, mapping
+def plot_result(top_topics, scores):
+    scores *= 100
+    fig = px.bar(x=scores, y=top_topics, orientation='h',
+                 labels={'x': 'Confidence', 'y': 'Label'},
+                 text=scores,
+                 range_x=(0,115),
+                 title='Top Predictions',
+                 color=np.linspace(0,1,len(scores)),
+                 color_continuous_scale='GnBu')
+    fig.update(layout_coloraxis_showscale=False)
+    fig.update_traces(texttemplate='%{text:0.1f}%', textposition='outside')
+    st.plotly_chart(fig)
+def main():
+    with open("style.css") as f:
+        st.markdown('<style>{}</style>'.format(f.read()), unsafe_allow_html=True)
+    ex_names, ex_map = load_examples()
+    logo = Image.open('huggingface_logo.png')
+    st.sidebar.image(logo, width=120)
+    st.sidebar.markdown(ZSL_DESC)
+    model_desc = st.sidebar.selectbox('Model', list(MODEL_DESC.keys()), 0)
+    do_print_code = st.sidebar.checkbox('Show code snippet', False)
+    st.sidebar.markdown('#### Model Description')
+    st.sidebar.markdown(MODEL_DESC[model_desc])
+    st.sidebar.markdown('Originally proposed by [Yin et al. (2019)](https://arxiv.org/abs/1909.00161). Read more in our [blog post](https://joeddav.github.io/blog/2020/05/29/ZSL.html).')
+    st.title('Zero Shot Topic Classification')
+    example = st.selectbox('Choose an example', ex_names)
+    height = min((len(ex_map[example][0].split()) + 1) * 2, 200)
+    sequence = st.text_area('Text', ex_map[example][0], key='sequence', height=height)
+    labels = st.text_input('Possible topics (comma-separated)', ex_map[example][1], max_chars=1000)
+    labels = list(set([x.strip() for x in labels.strip().split(',') if len(x.strip()) > 0]))
+    if len(labels) == 0 or len(sequence) == 0:
+        st.write('Enter some text and at least one possible topic to see predictions.')
+        return
+    if do_print_code:
+        st.markdown(CODE_DESC)
+    model_id = model_ids[model_desc]
+    with st.spinner('Classifying...'):
+        top_topics, scores = get_most_likely(model_id, sequence, labels, do_print_code)
+    plot_result(top_topics[-10:], scores[-10:])
+if __name__ == '__main__':
+    main()

huggingface_logo.png ADDED Viewed

requirements.txt ADDED Viewed

	@@ -0,0 +1,85 @@

+altair==4.1.0
+appnope==0.1.0
+astor==0.8.1
+attrs==19.3.0
+backcall==0.1.0
+base58==2.0.0
+bleach==3.1.5
+blinker==1.4
+boto3==1.13.12
+botocore==1.16.12
+cachetools==4.1.0
+certifi==2020.4.5.1
+chardet==3.0.4
+click==7.1.2
+decorator==4.4.2
+defusedxml==0.6.0
+docutils==0.15.2
+entrypoints==0.3
+enum-compat==0.0.3
+filelock==3.0.12
+future==0.18.2
+idna==2.9
+importlib-metadata==1.6.0
+ipykernel==5.2.1
+ipython==7.14.0
+ipython-genutils==0.2.0
+ipywidgets==7.5.1
+jedi==0.17.0
+Jinja2==2.11.2
+jmespath==0.10.0
+joblib==0.15.1
+jsonschema==3.2.0
+jupyter-client==6.1.3
+jupyter-core==4.6.3
+MarkupSafe==1.1.1
+mistune==0.8.4
+nbconvert==5.6.1
+nbformat==5.0.6
+notebook==6.0.3
+numpy==1.18.4
+packaging==20.3
+pandas==1.0.3
+pandocfilters==1.4.2
+parso==0.7.0
+pathtools==0.1.2
+pexpect==4.8.0
+pickleshare==0.7.5
+Pillow==7.1.2
+prometheus-client==0.7.1
+prompt-toolkit==3.0.5
+protobuf==3.12.0
+ptyprocess==0.6.0
+pydeck==0.3.1
+Pygments==2.6.1
+pyparsing==2.4.7
+pyrsistent==0.16.0
+python-dateutil==2.8.1
+pytz==2020.1
+pyzmq==19.0.1
+regex==2020.5.14
+requests==2.23.0
+s3transfer==0.3.3
+sacremoses==0.0.43
+Send2Trash==1.5.0
+sentencepiece==0.1.90
+six==1.14.0
+streamlit==0.60.0
+terminado==0.8.3
+testpath==0.4.4
+tokenizers==0.7.0
+toml==0.10.1
+toolz==0.10.0
+torch==1.5.0
+tornado==5.1.1
+tqdm==4.46.0
+traitlets==4.3.3
+transformers==2.9.1
+tzlocal==2.1
+urllib3==1.25.9
+validators==0.15.0
+watchdog==0.10.2
+wcwidth==0.1.9
+webencodings==0.5.1
+widgetsnbextension==3.5.1
+zipp==3.1.0

style.css ADDED Viewed

	@@ -0,0 +1,4 @@

+.fullScreenFrame > div {
+    display: flex;
+    justify-content: center;
+}

texts.json ADDED Viewed

	@@ -0,0 +1,21 @@

+{
+    "name": {
+        "0":"\"Jupyter's Biggest Moons Started as Tiny Grains of Hail\"",
+        "1":"Who are you voting for in 2020?",
+        "2":"Attention is all you need",
+        "3":"IMDB Avengers Review",
+        "4":"Bose QuietComfort"
+    }, "text": {
+        "0":"Jupiter\u2019s Biggest Moons Started as Tiny Grains of Hail\n\nA new model offers an explanation for how the Galilean satellites formed around the solar system\u2019s largest world.\n\nKonstantin Batygin did not set out to solve one of the solar system\u2019s most puzzling mysteries when he went for a run up a hill in Nice, France. Dr. Batygin, a Caltech researcher, best known for his contributions to the search for the solar system\u2019s missing \u201cPlanet Nine,\u201d spotted a beer bottle. At a steep, 20 degree grade, he wondered why it wasn\u2019t rolling down the hill.\n\nHe realized there was a breeze at his back holding the bottle in place. Then he had a thought that would only pop into the mind of a theoretical astrophysicist: \u201cOh! This is how Europa formed.\u201d\n\nEuropa is one of Jupiter\u2019s four large Galilean moons. And in a paper published Monday in the Astrophysical Journal, Dr. Batygin and a co-author, Alessandro Morbidelli, a planetary scientist at the C\u00f4te d\u2019Azur Observatory in France, present a theory explaining how some moons form around gas giants like Jupiter and Saturn, suggesting that millimeter-sized grains of hail produced during the solar system\u2019s formation became trapped around these massive worlds, taking shape one at a time into the potentially habitable moons we know today.",
+        "1": "Who are you voting for in 2020?",
+        "2": "The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.",
+        "3": "Are you a fan of epic adventure movies? Then this is your dream come true! Truly this is the ultimate superhero mash-up and it's executed perfectly. Props to the filmmakers for taking the time to design it to be more than just a superhero film packed with action scenes and adding depth to each character.",
+        "4": "What happens when you clear away the noisy distractions of the world? Concentration goes to the next level. You get deeper into your music, your work, or whatever you want to focus on. That’s the power of Bose QuietComfort 35 wireless headphones II. Put them on and get closer to what you’re most passionate about. And that’s just the beginning. QuietComfort 35 wireless headphones II are now enabled with Bose AR — an innovative, audio-only take on augmented reality. Embedded inside your headphones is a multi-directional motion sensor. One that Bose AR can utilize to provide contextual audio based on where you are. Unlock Bose AR via a firmware update through the Bose Connect app. They’re Alexa-enabled, too, so you can enjoy entertainment, get information, and manage your day — all without looking at your phone. Adjust your level of noise cancelling between three settings using the Action button or the Bose Connect app. Volume-optimized EQ gives you balanced audio performance at any volume, and a noise-rejecting dual-microphone system provides clearer calls, even in noisy environments. And with easy Bluetooth pairing, 20 hours of battery life, and a durable, comfortable fit — you can keep the music or the quiet going all day long. Included: QuietComfort 35 II, carrying case, charging cable, audio cable for enjoying music without battery power."
+    }, "labels": {
+        "0":"space & cosmos, scientific discovery, microbiology, robots, archeology",
+        "1":"foreign policy, Europe, elections, business, 2020, outdoor recreation, politics",
+        "2":"machine learning, statistics, translation, vision",
+        "3":"films, action, superheroes, books",
+        "4":"electronics, headphones, health & wellness, furniture, software, pet supplies"
+    }
+}