Upload 12 files

Browse files

Files changed (12) hide show

1_Pooling/config.json +7 -0
README.md +128 -3
config.json +25 -0
config_sentence_transformers.json +7 -0
eval/binary_classification_evaluation_results.csv +16 -0
modules.json +14 -0
pytorch_model.bin +3 -0
sentence_bert_config.json +4 -0
special_tokens_map.json +7 -0
tokenizer.json +0 -0
tokenizer_config.json +16 -0
vocab.txt +0 -0

1_Pooling/config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "word_embedding_dimension": 1024,
+  "pooling_mode_cls_token": false,
+  "pooling_mode_mean_tokens": true,
+  "pooling_mode_max_tokens": false,
+  "pooling_mode_mean_sqrt_len_tokens": false
+}

README.md CHANGED Viewed

@@ -1,3 +1,128 @@
----
-license: mit
----

+---
+pipeline_tag: sentence-similarity
+tags:
+- sentence-transformers
+- feature-extraction
+- sentence-similarity
+- transformers
+license: mit
+base_model:
+- deutsche-telekom/gbert-large-paraphrase-cosine
+---
+# protestclaims_gbert
+This is a fine-tuned [sentence-transformers](https://www.SBERT.net) model based on [deutsche-telekom/gbert-large-paraphrase-cosine]: It maps sentences & paragraphs to a 1024 dimensional dense vector space and can be used for tasks like clustering or semantic search.
+THe model was fine-tuned using manually annotated sentences from German newspapers containing information about protests and the protesters’ claims.
+## Usage (Sentence-Transformers)
+Using this model becomes easy when you have [sentence-transformers](https://www.SBERT.net) installed:
+```
+pip install -U sentence-transformers
+```
+Then you can use the model like this:
+```python
+from sentence_transformers import SentenceTransformer
+sentences = ["This is an example sentence", "Each sentence is converted"]
+model = SentenceTransformer('shaunss/protestclaims_gbert')
+embeddings = model.encode(sentences)
+print(embeddings)
+```
+## Usage (HuggingFace Transformers)
+Without [sentence-transformers](https://www.SBERT.net), you can use the model like this: First, you pass your input through the transformer model, then you have to apply the right pooling-operation on-top of the contextualized word embeddings.
+```python
+from transformers import AutoTokenizer, AutoModel
+import torch
+#Mean Pooling - Take attention mask into account for correct averaging
+def mean_pooling(model_output, attention_mask):
+    token_embeddings = model_output[0] #First element of model_output contains all token embeddings
+    input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
+    return torch.sum(token_embeddings * input_mask_expanded, 1) / torch.clamp(input_mask_expanded.sum(1), min=1e-9)
+# Sentences we want sentence embeddings for
+sentences = ['This is an example sentence', 'Each sentence is converted']
+# Load model from HuggingFace Hub
+tokenizer = AutoTokenizer.from_pretrained('shaunss/protestclaims_gbert')
+model = AutoModel.from_pretrained('shaunss/protestclaims_gbert')
+# Tokenize sentences
+encoded_input = tokenizer(sentences, padding=True, truncation=True, return_tensors='pt')
+# Compute token embeddings
+with torch.no_grad():
+    model_output = model(**encoded_input)
+# Perform pooling. In this case, mean pooling.
+sentence_embeddings = mean_pooling(model_output, encoded_input['attention_mask'])
+print("Sentence embeddings:")
+print(sentence_embeddings)
+```
+<!---
+## Evaluation Results
+ Describe how your model was evaluated
+For an automated evaluation of this model, see the *Sentence Embeddings Benchmark*: [https://seb.sbert.net](https://seb.sbert.net?model_name=protestclaims_gbert)
+-->
+## Training
+The model was trained with the parameters:
+**DataLoader**:
+`torch.utils.data.dataloader.DataLoader` of length 103 with parameters:
+```
+{'batch_size': 32, 'sampler': 'torch.utils.data.sampler.RandomSampler', 'batch_sampler': 'torch.utils.data.sampler.BatchSampler'}
+```
+**Loss**:
+`sentence_transformers.losses.BatchSemiHardTripletLoss.BatchSemiHardTripletLoss`
+Parameters of the fit()-Method:
+```
+{
+    "epochs": 15,
+    "evaluation_steps": 329.20000000000005,
+    "evaluator": "sentence_transformers.evaluation.BinaryClassificationEvaluator.BinaryClassificationEvaluator",
+    "max_grad_norm": 1,
+    "optimizer_class": "<class 'torch.optim.adamw.AdamW'>",
+    "optimizer_params": {
+        "lr": 2e-05
+    },
+    "scheduler": "WarmupLinear",
+    "steps_per_epoch": null,
+    "warmup_steps": 329.20000000000005,
+    "weight_decay": 0.01
+}
+```
+## Full Model Architecture
+```
+SentenceTransformer(
+  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
+  (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False})
+)
+```
+## Citing & Authors
+<!--- Describe where people can find more information -->

config.json ADDED Viewed

	@@ -0,0 +1,25 @@

+{
+  "_name_or_path": "/home/shaunss/.cache/torch/sentence_transformers/deutsche-telekom_gbert-large-paraphrase-cosine/",
+  "architectures": [
+    "BertModel"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 1024,
+  "initializer_range": 0.02,
+  "intermediate_size": 4096,
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 16,
+  "num_hidden_layers": 24,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "torch_dtype": "float32",
+  "transformers_version": "4.29.2",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 31102
+}

config_sentence_transformers.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "__version__": {
+    "sentence_transformers": "2.2.2",
+    "transformers": "4.22.2",
+    "pytorch": "1.12.1+cu102"
+  }
+}

eval/binary_classification_evaluation_results.csv ADDED Viewed

	@@ -0,0 +1,16 @@

+epoch,steps,cossim_accuracy,cossim_accuracy_threshold,cossim_f1,cossim_precision,cossim_recall,cossim_f1_threshold,cossim_ap,manhattan_accuracy,manhattan_accuracy_threshold,manhattan_f1,manhattan_precision,manhattan_recall,manhattan_f1_threshold,manhattan_ap,euclidean_accuracy,euclidean_accuracy_threshold,euclidean_f1,euclidean_precision,euclidean_recall,euclidean_f1_threshold,euclidean_ap,dot_accuracy,dot_accuracy_threshold,dot_f1,dot_precision,dot_recall,dot_f1_threshold,dot_ap
+0,-1,0.921067053513862,0.6174301505088806,0.658887858223893,0.6309687071437476,0.6893921334922527,0.5224368572235107,0.6568646746104293,0.9210307865892973,512.619384765625,0.6595432434660731,0.6393542199488491,0.6810488676996425,576.3082275390625,0.6604771059232328,0.9209743713733075,20.297161102294922,0.6589871484508278,0.6301267710663684,0.690618082751575,22.839641571044922,0.6603758634801408,0.921248388136686,332.421630859375,0.6571105243304414,0.6337952417109592,0.6822067086667801,287.6917419433594,0.6449042740241154
+1,-1,0.930705996131528,0.6919203996658325,0.708137328379391,0.6658669705511906,0.7561382598331347,0.5270371437072754,0.7445783186925214,0.9303836234687298,504.5845642089844,0.7081255311983836,0.6691822755652543,0.7518814915715989,583.332275390625,0.7470769619207115,0.9304198903932946,18.130956649780273,0.7079736884325364,0.6693159411497043,0.7513706793802145,22.86447525024414,0.7467618178299762,0.9311895551257253,392.57977294921875,0.7085049210563975,0.6645192078262945,0.7587263749361485,294.33001708984375,0.7292517790140433
+2,-1,0.9343931334622824,0.7136921882629395,0.7173332443079389,0.703499983629637,0.7317214370849651,0.6156739592552185,0.7600254060951601,0.9345462604771115,468.803955078125,0.7178995203231507,0.7097171381031614,0.7262727737101993,525.3678588867188,0.7615692068716018,0.9343931334622824,18.204402923583984,0.7179213501884527,0.7034676602281269,0.7329814404903797,20.98118782043457,0.7611671233527293,0.9345059638942618,448.7745666503906,0.7164497475294549,0.7015207884602833,0.7320279243997957,349.7630310058594,0.7531021507097297
+3,-1,0.9334824306898775,0.8246881365776062,0.7137666532095276,0.7052488116211814,0.7224927634939554,0.6401357650756836,0.7553650312900878,0.9333333333333333,356.225830078125,0.7140536829590931,0.7023982079325339,0.7261025029797378,526.1380615234375,0.7556749999748844,0.9334260154738878,14.090566635131836,0.7143887184130656,0.7036865750528541,0.725421420057892,20.60045051574707,0.7557456456436582,0.9337685364281109,486.42901611328125,0.7126337187989469,0.6977062873176939,0.7282138600374596,365.5035400390625,0.7538615733147203
+4,-1,0.9338531592520954,0.814205527305603,0.7100576835421153,0.6593496409597758,0.7692150519325728,0.49904677271842957,0.7792947209946086,0.9338128626692457,386.36199951171875,0.7111637991545651,0.6561132857471794,0.7762983143197685,618.905517578125,0.7805806775525649,0.9338813668600903,14.816593170166016,0.7098461102272906,0.6603657464392474,0.767342073897497,24.055896759033203,0.7795874128291687,0.9339458413926499,478.12628173828125,0.7105407997987137,0.6600829778530941,0.7693512685169419,290.177490234375,0.7739095891952419
+5,-1,0.9366618310767247,0.8182272911071777,0.7154977541614067,0.6287763473594592,0.8299676485612123,0.3864666521549225,0.7800573554713007,0.936621534493875,374.20831298828125,0.7172483863256037,0.6389798471900541,0.8173676145070662,666.6046142578125,0.7809913763265229,0.9366376531270149,14.639816284179688,0.7152236229695158,0.6532473747923764,0.7901924059254214,25.243518829345703,0.7809028159987476,0.9366416827852998,482.78955078125,0.7157549298250754,0.6280821741701591,0.8318746807423804,223.10125732421875,0.7710827491508985
+6,-1,0.9335469052224371,0.7220498323440552,0.7076933429787521,0.6993283681340604,0.7162608547590669,0.5646674633026123,0.7638088155688725,0.9334461637653128,464.3895263671875,0.7083065688495216,0.7032188769173967,0.7134684147794994,572.7876586914062,0.7641790313814547,0.9335106382978723,18.105064392089844,0.7073019698415088,0.695418942934246,0.719598161076111,22.878162384033203,0.7642659600345677,0.9339700193423598,429.23663330078125,0.7077593929048287,0.7025702972954835,0.7130257108802996,336.3193054199219,0.7590580728193954
+7,-1,0.9380641521598968,0.7339451313018799,0.7145889147907172,0.7304477718106483,0.699404052443385,0.5424098968505859,0.7802089257311332,0.9379795293359123,445.5428161621094,0.7142410341951627,0.729165927555256,0.6999148646347693,586.2467651367188,0.7812387549609684,0.9380399742101869,17.663646697998047,0.7144247480013903,0.7295119787045253,0.6999489187808615,23.065956115722656,0.780661304599755,0.9380117666021921,441.88720703125,0.7143132276985029,0.7350990990990991,0.6946705261365571,321.2008056640625,0.77632069370724
+8,-1,0.9389547066408769,0.7134076356887817,0.7362523066599563,0.7255414118036039,0.7472841818491401,0.5718396902084351,0.7837278579155966,0.938930528691167,478.111572265625,0.7356171941616092,0.7225579714905078,0.7491571598842159,574.936767578125,0.7841746458023153,0.9389506769825918,18.565975189208984,0.7355403666505123,0.7221565925050863,0.7494295930529542,22.627147674560547,0.7842885252023064,0.9388620245003224,431.0670166015625,0.7367326051576315,0.7261211800502712,0.7476587774561553,337.70526123046875,0.7731685327224174
+9,-1,0.9398331721470019,0.6916142702102661,0.7328376574983968,0.7461706783369803,0.7199727566831262,0.5889520049095154,0.772660656556971,0.9397082527401677,486.5942077636719,0.7326520748463715,0.7428851471978156,0.722697088370509,564.412353515625,0.7733164125359162,0.9396760154738878,18.512386322021484,0.73282733103961,0.7442930392638899,0.7217095181338328,22.103715896606445,0.7732477636608306,0.9398936170212766,398.99700927734375,0.7331376320140365,0.7264702932227758,0.7399284862932062,322.1018981933594,0.7649336343573698
+10,-1,0.9407519342359768,0.7781369686126709,0.739671546610896,0.7447271220960336,0.734684147794994,0.598785400390625,0.7880129413809182,0.9405988072211476,415.1871337890625,0.7390562043297139,0.7410552264868011,0.7370679380214541,560.383056640625,0.7891372156012788,0.9407599935525467,16.461021423339844,0.7394946513572076,0.7409737417943107,0.7380214541120381,22.078475952148438,0.7886224309074015,0.9407519342359768,452.0459899902344,0.7404528732058655,0.7512617412028599,0.7299506214881661,361.9554443359375,0.7780874244480435
+11,-1,0.9406108961960026,0.696799635887146,0.7308022381655281,0.7367200747482437,0.7249787161586924,0.556389570236206,0.7790594718334573,0.9403973243068988,480.479736328125,0.7296637039347159,0.742698927765237,0.7170781542652818,568.3990478515625,0.7805541007996804,0.9403892649903288,19.026138305664062,0.729687257153712,0.7402074136360451,0.7194619444917418,22.555213928222656,0.7795796849932237,0.9408204384268214,417.43902587890625,0.7324324324324324,0.7358308987798591,0.7290652136897667,323.7745666503906,0.769907397964928
+12,-1,0.9395027401676338,0.6551991701126099,0.7275134579536942,0.7447567413325724,0.7110505704069471,0.553314745426178,0.7755738324174184,0.9394906511927789,512.1928100585938,0.7277290277290278,0.7507876154263987,0.7060446109313809,576.0177001953125,0.7766925710814888,0.9393818504190845,20.226606369018555,0.7269002468876398,0.748107835363656,0.7068619104375958,22.730484008789062,0.7763753765070203,0.9395913926499033,389.4950256347656,0.7286559299261921,0.7471552279395978,0.7110505704069471,327.77191162109375,0.7644954507599309
+13,-1,0.9389144100580271,0.6844364404678345,0.728031253866959,0.7569475077194531,0.7012429763323684,0.595650315284729,0.7730647722642658,0.9388056092843327,488.16650390625,0.7276908110020341,0.757085020242915,0.7004937851183382,554.832763671875,0.7742282510583335,0.9388579948420374,19.26504135131836,0.7278375484844761,0.7509416195856874,0.7061127192235654,22.17256736755371,0.7740326348416876,0.9389627659574468,408.8873291015625,0.7288357912648895,0.7600088721304203,0.700119189511323,354.1147155761719,0.7617965621320361
+14,-1,0.9385315925209542,0.7158571481704712,0.7225264954833204,0.7560567154181087,0.6918440320108973,0.5912114381790161,0.7719068643122562,0.9385315925209542,464.43426513671875,0.7219023101415893,0.718083493379157,0.7257619615188149,608.8768920898438,0.7729928601549992,0.9385557704706641,18.424461364746094,0.721666369948371,0.7561275881365417,0.6902094329984676,21.92572784423828,0.7728280322067003,0.9384509993552547,422.6900634765625,0.7239607987517314,0.7460793524607935,0.7031159543674442,337.285888671875,0.7613874415494246

modules.json ADDED Viewed

	@@ -0,0 +1,14 @@

+[
+  {
+    "idx": 0,
+    "name": "0",
+    "path": "",
+    "type": "sentence_transformers.models.Transformer"
+  },
+  {
+    "idx": 1,
+    "name": "1",
+    "path": "1_Pooling",
+    "type": "sentence_transformers.models.Pooling"
+  }
+]

pytorch_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:22dceb36ad828d00b61e7a21e01042cfc77128931cdf00185be70fa7108982ee
+size 1343079533

sentence_bert_config.json ADDED Viewed

	@@ -0,0 +1,4 @@

+{
+  "max_seq_length": 512,
+  "do_lower_case": false
+}

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,16 @@

+{
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_basic_tokenize": true,
+  "do_lower_case": false,
+  "mask_token": "[MASK]",
+  "max_len": 512,
+  "model_max_length": 512,
+  "never_split": null,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": false,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff