tasksource
/

ettin-32m-embed

@@ -7,109 +7,113 @@ tags:
 - feature-extraction
 - dense
 - generated_from_trainer
-- dataset_size:4469886
 - loss:AnglELoss
 - loss:CoSENTLoss
 - loss:CachedMultipleNegativesRankingLoss
 base_model: jhu-clsp/ettin-encoder-32m
 widget:
-- source_sentence: are finn and princess bubblegum dating?
   sentences:
-  - In "Wizard Battle," Finn gets a kiss from Princess Bubblegum again (her last to
-    date) after winning the Wizard Battle. ... In "In Your Footsteps," Jake says that
-    Princess Bubblegum was his ex-girlfriend, but Finn replies "we never went steady."
-  - Jack Dylan Grazer Is Dating Finn Wolfhard According To Wikipedia. It was for a
-    total of 14 minutes before the troll edit was taken down by an eagle-eyed fan.
-  - Personal life. In January 2014, she dated Nollywood actor Jim Iyke. On 21 March
-    2014, Jim Iyke proposed to Buari. She is a mother of twin girls but Jim Iyke is
-    not the father.
-  - At one point, Sonic and Amy start dating but their relationship doesn't last long
-    (in fact it's shorter than most of Sonic's relationships) and it turns back into
-    the one-sided relationship it used to be.
-  - Finneas is the older sibling. Billie is 17 and Finneas is turning 22-years-old
-    in just a few days. He was born on July 20, 1997. Finneas has described his family
-    as close-knit before, so we can probably assume that Billie and him are pretty
-    tight.
-  - “Hello, Bonnibel” This is the earliest episode on the list and it's the first
-    time we see Marceline and Bubblegum share the screen. It's couples night in the
-    Land of Ooo and Finn is trying to ask Princess Bubblegum out on a date. Naturally,
-    he and Jake turn to their vampire friend Marceline for advice.
-- source_sentence: The joint management of the memorial by the United States Navy
-    and the National Park Service was founded on 9 September 1980 .
   sentences:
-  - 'For example , in JavaScript the factorial function can be defined via such recursion
-    as anonymous :'
-  - The expansion of China Airlines political presence has long been limited by the
-    international status of Taiwan .
-  - The joint administration of the memorial by the National Park Service and the
-    United States Navy was established on September 9 , 1980 .
-- source_sentence: A man is playing the piano.
   sentences:
-  - A woman is playing a guitar on streets.
-  - A man puts pieces of meat into a plastic bag.
-  - A woman is playing the violin.
-- source_sentence: A person peels shrimp.
   sentences:
-  - The lady peeled the shrimp.
-  - Two women are sitting close to each other and talking.
-  - A woman is dancing in the street.
-- source_sentence: can you grow italian cypress in pots
   sentences:
-  - Pots and Soil. Container size depends on the size of an Italian cypress at planting.
-    A good rule of thumb is to use a container that is 2 inches larger in diameter
-    than the nursery pot in which the tree grows. As the tree increases in size, repot
-    it in a larger container until the tree reaches the size you desire.
-  - BCIP-NBT Solution is a 5-bromo, 4-chloro, 3-indolylphosphate (BCIP)/Nitro-Blue
-    Tetrazolium (NBT) substrate, used for the localization of alkaline phosphatase
-    (AP) labeled probes on western, northern, southern and dot blots.
-  - Dwarf varieties do great as ornamentals in small pots. Larger trees are an interesting
-    alternative to the traditional Christmas tree. The lemon cypress does best in
-    a location that gets at least five hours of full sun per day. If you are going
-    to keep them indoors, keep the cypress near the window for maximum light.
 datasets:
 - google-research-datasets/paws
 - nyu-mll/glue
 - mwong/fever-evidence-related
 - tasksource/sts-companion
 - tasksource/zero-shot-label-nli
-- tomaarsen/natural-questions-hard-negatives
-- tomaarsen/gooaq-hard-negatives
-- bclavie/msmarco-500k-triplets
-- sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1
-- sentence-transformers/gooaq
-- sentence-transformers/natural-questions
-- sentence-transformers/quora-duplicates
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 ---
 # SentenceTransformer based on jhu-clsp/ettin-encoder-32m
-This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [jhu-clsp/ettin-encoder-32m](https://huggingface.co/jhu-clsp/ettin-encoder-32m) on 14 datasets. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
 - **Base model:** [jhu-clsp/ettin-encoder-32m](https://huggingface.co/jhu-clsp/ettin-encoder-32m) <!-- at revision 1b8ba06455dd44f80fc9c1ca9e22806157a57379 -->
-- **Maximum Sequence Length:** 512 tokens
 - **Output Dimensionality:** 384 dimensions
 - **Similarity Function:** Cosine Similarity
 - **Training Datasets:**
     - [paws/labeled_final](https://huggingface.co/datasets/paws)
     - [glue/mrpc](https://huggingface.co/datasets/glue)
     - [fever-evidence-related](https://huggingface.co/datasets/mwong/fever-evidence-related)
     - [glue/stsb](https://huggingface.co/datasets/glue)
     - sick/relatedness
     - [sts-companion](https://huggingface.co/datasets/tasksource/sts-companion)
     - [zero-shot-label-nli](https://huggingface.co/datasets/tasksource/zero-shot-label-nli)
-    - [tomaarsen/natural-questions-hard-negatives](https://huggingface.co/datasets/tomaarsen/natural-questions-hard-negatives)
-    - [tomaarsen/gooaq-hard-negatives](https://huggingface.co/datasets/tomaarsen/gooaq-hard-negatives)
-    - [bclavie/msmarco-500k-triplets](https://huggingface.co/datasets/bclavie/msmarco-500k-triplets)
-    - [sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1](https://huggingface.co/datasets/sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1)
-    - [sentence-transformers/gooaq](https://huggingface.co/datasets/sentence-transformers/gooaq)
-    - [sentence-transformers/natural-questions](https://huggingface.co/datasets/sentence-transformers/natural-questions)
-    - [sentence-transformers/quora-duplicates](https://huggingface.co/datasets/sentence-transformers/quora-duplicates)
 - **Language:** en
 <!-- - **License:** Unknown -->
@@ -123,7 +127,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [j
 ```
 SentenceTransformer(
-  (0): Transformer({'max_seq_length': 512, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
   (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
   (2): Normalize()
 )
@@ -147,12 +151,12 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("tasksource/ettin-32m-embed")
 # Run inference
 queries = [
-    "can you grow italian cypress in pots",
 ]
 documents = [
-    'Pots and Soil. Container size depends on the size of an Italian cypress at planting. A good rule of thumb is to use a container that is 2 inches larger in diameter than the nursery pot in which the tree grows. As the tree increases in size, repot it in a larger container until the tree reaches the size you desire.',
-    'Dwarf varieties do great as ornamentals in small pots. Larger trees are an interesting alternative to the traditional Christmas tree. The lemon cypress does best in a location that gets at least five hours of full sun per day. If you are going to keep them indoors, keep the cypress near the window for maximum light.',
-    'BCIP-NBT Solution is a 5-bromo, 4-chloro, 3-indolylphosphate (BCIP)/Nitro-Blue Tetrazolium (NBT) substrate, used for the localization of alkaline phosphatase (AP) labeled probes on western, northern, southern and dot blots.',
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
@@ -162,7 +166,7 @@ print(query_embeddings.shape, document_embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
-# tensor([[0.7220, 0.4432, 0.1073]])
 ```
 <!--
@@ -264,10 +268,10 @@ You can finetune this model on your own dataset.
 * Size: 403,218 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                         | sentence2                                                                            | label                                           |
-  |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                            | string                                                                               | int                                             |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 13.92 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 33 tokens</li><li>mean: 254.85 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>0: ~29.20%</li><li>1: ~70.80%</li></ul> |
 * Samples:
   | sentence1                                                                    | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
   |:-----------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
@@ -282,6 +286,58 @@ You can finetune this model on your own dataset.
   }
   ```
 </details>
 <details><summary>glue/stsb</summary>
 #### glue/stsb
@@ -368,10 +424,10 @@ You can finetune this model on your own dataset.
 * Size: 800,000 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | label                                           | sentence1                                                                          | sentence2                                                                        |
-  |:--------|:------------------------------------------------|:-----------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|
-  | type    | int                                             | string                                                                             | string                                                                           |
-  | details | <ul><li>0: ~51.20%</li><li>1: ~48.80%</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 60.01 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 8.01 tokens</li><li>max: 16 tokens</li></ul> |
 * Samples:
   | label          | sentence1                                                                                                                                                                                                 | sentence2                                   |
   |:---------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------|
@@ -390,14 +446,14 @@ You can finetune this model on your own dataset.
 #### tomaarsen/natural-questions-hard-negatives
-* Dataset: [tomaarsen/natural-questions-hard-negatives](https://huggingface.co/datasets/tomaarsen/natural-questions-hard-negatives) at [52dfa09](https://huggingface.co/datasets/tomaarsen/natural-questions-hard-negatives/tree/52dfa09a3d5d3f90e7e115c407ccebe30fe79764)
 * Size: 96,658 training samples
 * Columns: <code>query</code>, <code>answer</code>, <code>negative_1</code>, <code>negative_2</code>, <code>negative_3</code>, <code>negative_4</code>, and <code>negative_5</code>
 * Approximate statistics based on the first 1000 samples:
   |         | query                                                                              | answer                                                                               | negative_1                                                                           | negative_2                                                                           | negative_3                                                                           | negative_4                                                                           | negative_5                                                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
   | type    | string                                                                             | string                                                                               | string                                                                               | string                                                                               | string                                                                               | string                                                                               | string                                                                               |
-  | details | <ul><li>min: 10 tokens</li><li>mean: 12.52 tokens</li><li>max: 26 tokens</li></ul> | <ul><li>min: 17 tokens</li><li>mean: 137.81 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 23 tokens</li><li>mean: 143.36 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 13 tokens</li><li>mean: 142.38 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 15 tokens</li><li>mean: 145.93 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 145.76 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 141.95 tokens</li><li>max: 512 tokens</li></ul> |
 * Samples:
   | query                                                           | answer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | negative_1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | negative_2                                                                                                                                                                                                                                                                                                    | negative_3                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | negative_4                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | negative_5                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
   |:----------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
@@ -418,7 +474,7 @@ You can finetune this model on your own dataset.
 #### tomaarsen/gooaq-hard-negatives
-* Dataset: [tomaarsen/gooaq-hard-negatives](https://huggingface.co/datasets/tomaarsen/gooaq-hard-negatives) at [87594a1](https://huggingface.co/datasets/tomaarsen/gooaq-hard-negatives/tree/87594a1e6c58e88b5843afa9da3a97ffd75d01c2)
 * Size: 800,000 training samples
 * Columns: <code>question</code>, <code>answer</code>, <code>negative_1</code>, <code>negative_2</code>, <code>negative_3</code>, <code>negative_4</code>, and <code>negative_5</code>
 * Approximate statistics based on the first 1000 samples:
@@ -446,7 +502,7 @@ You can finetune this model on your own dataset.
 #### bclavie/msmarco-500k-triplets
-* Dataset: [bclavie/msmarco-500k-triplets](https://huggingface.co/datasets/bclavie/msmarco-500k-triplets) at [cb1a85c](https://huggingface.co/datasets/bclavie/msmarco-500k-triplets/tree/cb1a85c1261fa7c65f4ea43f94e50f8b467c372f)
 * Size: 500,000 training samples
 * Columns: <code>query</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
@@ -474,7 +530,7 @@ You can finetune this model on your own dataset.
 #### sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1
-* Dataset: [sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1](https://huggingface.co/datasets/sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1) at [84ed2d3](https://huggingface.co/datasets/sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1/tree/84ed2d35626f617d890bd493b4d6db69a741e0e2)
 * Size: 800,000 training samples
 * Columns: <code>query</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
@@ -502,7 +558,7 @@ You can finetune this model on your own dataset.
 #### sentence-transformers/gooaq
-* Dataset: [sentence-transformers/gooaq](https://huggingface.co/datasets/sentence-transformers/gooaq) at [b089f72](https://huggingface.co/datasets/sentence-transformers/gooaq/tree/b089f728748a068b7bc5234e5bcf5b25e3c8279c)
 * Size: 800,000 training samples
 * Columns: <code>question</code> and <code>answer</code>
 * Approximate statistics based on the first 1000 samples:
@@ -530,14 +586,14 @@ You can finetune this model on your own dataset.
 #### sentence-transformers/natural-questions
-* Dataset: [sentence-transformers/natural-questions](https://huggingface.co/datasets/sentence-transformers/natural-questions) at [f9e894e](https://huggingface.co/datasets/sentence-transformers/natural-questions/tree/f9e894e1081e206e577b4eaa9ee6de2b06ae6f17)
 * Size: 100,231 training samples
 * Columns: <code>query</code> and <code>answer</code>
 * Approximate statistics based on the first 1000 samples:
   |         | query                                                                              | answer                                                                               |
   |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
   | type    | string                                                                             | string                                                                               |
-  | details | <ul><li>min: 10 tokens</li><li>mean: 12.47 tokens</li><li>max: 23 tokens</li></ul> | <ul><li>min: 17 tokens</li><li>mean: 138.28 tokens</li><li>max: 512 tokens</li></ul> |
 * Samples:
   | query                                                           | answer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
   |:----------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
@@ -558,7 +614,7 @@ You can finetune this model on your own dataset.
 #### sentence-transformers/quora-duplicates
-* Dataset: [sentence-transformers/quora-duplicates](https://huggingface.co/datasets/sentence-transformers/quora-duplicates) at [451a485](https://huggingface.co/datasets/sentence-transformers/quora-duplicates/tree/451a4850bd141edb44ade1b5828c259abd762cdb)
 * Size: 101,762 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
@@ -582,11 +638,96 @@ You can finetune this model on your own dataset.
   }
   ```
 </details>
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
-- `per_device_train_batch_size`: 512
 - `weight_decay`: 1e-06
 - `num_train_epochs`: 2
 - `warmup_ratio`: 0.1
@@ -600,14 +741,14 @@ You can finetune this model on your own dataset.
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 512
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
-- `learning_rate`: 5e-05
 - `weight_decay`: 1e-06
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
@@ -633,7 +774,6 @@ You can finetune this model on your own dataset.
 - `seed`: 42
 - `data_seed`: None
 - `jit_mode_eval`: False
-- `use_ipex`: False
 - `bf16`: False
 - `fp16`: True
 - `fp16_opt_level`: O1
@@ -660,6 +800,7 @@ You can finetune this model on your own dataset.
 - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
 - `fsdp_transformer_layer_cls_to_wrap`: None
 - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
 - `deepspeed`: None
 - `label_smoothing_factor`: 0.0
 - `optim`: adamw_torch
@@ -667,6 +808,8 @@ You can finetune this model on your own dataset.
 - `adafactor`: False
 - `group_by_length`: False
 - `length_column_name`: length
 - `ddp_find_unused_parameters`: None
 - `ddp_bucket_cap_mb`: None
 - `ddp_broadcast_buffers`: False
@@ -699,7 +842,7 @@ You can finetune this model on your own dataset.
 - `torch_compile_backend`: None
 - `torch_compile_mode`: None
 - `include_tokens_per_second`: False
-- `include_num_input_tokens_seen`: False
 - `neftune_noise_alpha`: None
 - `optim_target_modules`: None
 - `batch_eval_metrics`: False
@@ -707,7 +850,7 @@ You can finetune this model on your own dataset.
 - `use_liger_kernel`: False
 - `liger_kernel_config`: None
 - `eval_use_gather_object`: False
-- `average_tokens_across_devices`: False
 - `prompts`: None
 - `batch_sampler`: batch_sampler
 - `multi_dataset_batch_sampler`: proportional
@@ -719,50 +862,114 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
-| 0.0572 | 500   | 5.9624        |
-| 0.1145 | 1000  | 4.0095        |
-| 0.1717 | 1500  | 3.5423        |
-| 0.2289 | 2000  | 3.1377        |
-| 0.2861 | 2500  | 2.9429        |
-| 0.3434 | 3000  | 3.3562        |
-| 0.4006 | 3500  | 3.2686        |
-| 0.4578 | 4000  | 2.7926        |
-| 0.5151 | 4500  | 2.8682        |
-| 0.5723 | 5000  | 3.1535        |
-| 0.6295 | 5500  | 3.0057        |
-| 0.6867 | 6000  | 2.8878        |
-| 0.7440 | 6500  | 3.3388        |
-| 0.8012 | 7000  | 2.7703        |
-| 0.8584 | 7500  | 3.1108        |
-| 0.9156 | 8000  | 2.7564        |
-| 0.9729 | 8500  | 2.7825        |
-| 1.0301 | 9000  | 2.8986        |
-| 1.0873 | 9500  | 2.594         |
-| 1.1446 | 10000 | 2.6633        |
-| 1.2018 | 10500 | 2.6969        |
-| 1.2590 | 11000 | 2.5086        |
-| 1.3162 | 11500 | 2.7733        |
-| 1.3735 | 12000 | 2.7151        |
-| 1.4307 | 12500 | 2.324         |
-| 1.4879 | 13000 | 2.563         |
-| 1.5452 | 13500 | 2.2177        |
-| 1.6024 | 14000 | 2.7113        |
-| 1.6596 | 14500 | 2.2981        |
-| 1.7168 | 15000 | 2.4663        |
-| 1.7741 | 15500 | 2.5749        |
-| 1.8313 | 16000 | 2.6534        |
-| 1.8885 | 16500 | 2.6012        |
-| 1.9457 | 17000 | 2.8114        |
 ### Framework Versions
 - Python: 3.12.10
 - Sentence Transformers: 5.1.2
-- Transformers: 4.53.2
 - PyTorch: 2.7.1+cu126
 - Accelerate: 1.7.0
 - Datasets: 3.6.0
-- Tokenizers: 0.21.1
 ## Citation

 - feature-extraction
 - dense
 - generated_from_trainer
+- dataset_size:6331245
 - loss:AnglELoss
 - loss:CoSENTLoss
 - loss:CachedMultipleNegativesRankingLoss
 base_model: jhu-clsp/ettin-encoder-32m
 widget:
+- source_sentence: what is paediatric clinical psychology
   sentences:
+  - Pediatric neuropsychology (paediatric in the UK) is a sub-speciality within the
+    field of clinical neuropsychology that studies the relationship between brain
+    health and behaviour in children.any pediatric neuropsychologists are involved
+    in teaching, research, supervision, and training of undergraduate and graduate
+    students in the field. In the United States undergraduate and graduate psychology
+    programs generally do not offer a track in pediatric neuropsychology, per se.
+  - "â\x80\x9CRealâ\x80\x9D hummus, should contain about 175 calories, out of which\
+    \ 70-80 calories are contributed by fat. The average Israeli eats 8-10 kilograms\
+    \ (18-22 pounds) of hummus every year, so weâ\x80\x99re talking about extra 15,000\
+    \ calories which can make him gain about 2.5kg of body weight each year. So you\
+    \ can see how excessive consumption of the packaged product might be fattening\
+    \ over the years. The common serving size of hummus (real hummus, that is), which\
+    \ is around one cup (220-240g) may contain 400-450 calories. And every pita (â\x80\
+    \x9Cpita breadâ\x80\x9D) contains another 270, so itâ\x80\x99s not really â\x80\
+    \x9Cdietaryâ\x80\x9D."
+  - "Pediatrics (also spelled paediatrics or pÃ¦diatrics) is the branch of medicine\
+    \ that involves the medical care of infants, children, and adolescents. The American\
+    \ Academy of Pediatrics recommends people be under pediatric care up to the age\
+    \ of 21.[1] A medical practitioner who specializes in this area is known as a\
+    \ pediatrician, or paediatrician. The word pediatrics and its cognates mean healer\
+    \ of children; they derive from two Greek words: Ï\x80Î±á¿\x96Ï\x82 (pais child)\
+    \ and á¼°Î±Ï\x84Ï\x81Ï\x8CÏ\x82 (iatros doctor, healer)."
+- source_sentence: These ancient rites are rarely performed in contemporary Sri Lanka
+    , but the conserved songs are still performed by folk musicians .
   sentences:
+  - In 1971 , a main campus was completed in 33 MacDonnell Road for the new school
+    .
+  - These ancient rites are still performed in contemporary Sri Lanka , but the preserved
+    songs are rarely performed by folk musicians .
+  - After May 4 , 2012 , Gordon M. Snow was replaced by Joseph M. Demarest and then
+    Michael S. Welch with limited formal announcement .
+- source_sentence: A woman is playing the flute.
   sentences:
+  - A boy is playing the trumpet.
+  - A man tries to read the paper.
+  - A man is playing the guitar.
+- source_sentence: Interference now on all our scans.
   sentences:
+  - Would you permit me to explain this Polly?
+  - All Ourscans are jammed.
+  - The aircraft family was first introduced at the Paris Air Show in 1999.
+- source_sentence: why has chs invested in da?
   sentences:
+  - In order to renew the strategic road map to CHS's growth, CHS partnered with DA
+    to improve outcomes rather than increasing its size. Most of DA's capacity was
+    used to provide tools in order to support CHS-affiliated hospitals in delivering
+    best-in-class healthcare to patients.
+  - You can in theory add every enchantment that is compatible with a tool/weapon/armor
+    onto the same item. The bow can have these 7 enchantments, though mending and
+    infinity are mutually exclusive. So you can have up to 6 different enchantments
+    on a bow using an anvil.
+  - 'Clean up is a phrasal verb which means: to make (a room or space) clean and orderly.
+    ... Clean out is a phrasal verb which means something such as a cupboard, room,
+    or container, you take everything out of it and clean the inside of it thoroughly.
+    Secondly, "clean"is a simple word which is often used in our daily life.'
 datasets:
 - google-research-datasets/paws
 - nyu-mll/glue
 - mwong/fever-evidence-related
+- tasksource/parade
+- tasksource/apt
 - tasksource/sts-companion
 - tasksource/zero-shot-label-nli
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 ---
 # SentenceTransformer based on jhu-clsp/ettin-encoder-32m
+This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [jhu-clsp/ettin-encoder-32m](https://huggingface.co/jhu-clsp/ettin-encoder-32m) on 19 datasets. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
 ## Model Details
 ### Model Description
 - **Model Type:** Sentence Transformer
 - **Base model:** [jhu-clsp/ettin-encoder-32m](https://huggingface.co/jhu-clsp/ettin-encoder-32m) <!-- at revision 1b8ba06455dd44f80fc9c1ca9e22806157a57379 -->
+- **Maximum Sequence Length:** 1024 tokens
 - **Output Dimensionality:** 384 dimensions
 - **Similarity Function:** Cosine Similarity
 - **Training Datasets:**
     - [paws/labeled_final](https://huggingface.co/datasets/paws)
     - [glue/mrpc](https://huggingface.co/datasets/glue)
     - [fever-evidence-related](https://huggingface.co/datasets/mwong/fever-evidence-related)
+    - [parade](https://huggingface.co/datasets/tasksource/parade)
+    - [apt](https://huggingface.co/datasets/tasksource/apt)
     - [glue/stsb](https://huggingface.co/datasets/glue)
     - sick/relatedness
     - [sts-companion](https://huggingface.co/datasets/tasksource/sts-companion)
     - [zero-shot-label-nli](https://huggingface.co/datasets/tasksource/zero-shot-label-nli)
+    - tomaarsen/natural-questions-hard-negatives
+    - tomaarsen/gooaq-hard-negatives
+    - bclavie/msmarco-500k-triplets
+    - sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1
+    - sentence-transformers/gooaq
+    - sentence-transformers/natural-questions
+    - sentence-transformers/quora-duplicates
+    - sentence-transformers/s2orc
+    - sentence-transformers/codesearchnet
+    - sentence-transformers/stackexchange-duplicates
 - **Language:** en
 <!-- - **License:** Unknown -->
 ```
 SentenceTransformer(
+  (0): Transformer({'max_seq_length': 1024, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
   (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
   (2): Normalize()
 )
 model = SentenceTransformer("tasksource/ettin-32m-embed")
 # Run inference
 queries = [
+    "why has chs invested in da?",
 ]
 documents = [
+    "In order to renew the strategic road map to CHS's growth, CHS partnered with DA to improve outcomes rather than increasing its size. Most of DA's capacity was used to provide tools in order to support CHS-affiliated hospitals in delivering best-in-class healthcare to patients.",
+    'You can in theory add every enchantment that is compatible with a tool/weapon/armor onto the same item. The bow can have these 7 enchantments, though mending and infinity are mutually exclusive. So you can have up to 6 different enchantments on a bow using an anvil.',
+    'Clean up is a phrasal verb which means: to make (a room or space) clean and orderly. ... Clean out is a phrasal verb which means something such as a cupboard, room, or container, you take everything out of it and clean the inside of it thoroughly. Secondly, "clean"is a simple word which is often used in our daily life.',
 ]
 query_embeddings = model.encode_query(queries)
 document_embeddings = model.encode_document(documents)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
+# tensor([[ 0.6237, -0.0022, -0.1018]])
 ```
 <!--
 * Size: 403,218 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                         | sentence2                                                                             | label                                           |
+  |:--------|:----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                            | string                                                                                | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 13.92 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 33 tokens</li><li>mean: 316.81 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>0: ~29.20%</li><li>1: ~70.80%</li></ul> |
 * Samples:
   | sentence1                                                                    | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
   |:-----------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
   }
   ```
 </details>
+<details><summary>parade</summary>
+#### parade
+* Dataset: [parade](https://huggingface.co/datasets/tasksource/parade) at [466978f](https://huggingface.co/datasets/tasksource/parade/tree/466978f31aebf4d052287f32ea3ae393f178f386)
+* Size: 7,550 training samples
+* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                         | sentence2                                                                         | label                                           |
+  |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                            | string                                                                            | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 21.97 tokens</li><li>max: 61 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 21.81 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>0: ~57.10%</li><li>1: ~42.90%</li></ul> |
+* Samples:
+  | sentence1                                                                                                                                                                             | sentence2                                                                                                                                                  | label          |
+  |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>predictive models are involved with predicting a value based on other values in the dataset. the process of training a predictive model is known as supervised learning.</code> | <code>predict a value based on other values in the dataset. process of training a pred model is supervised learning.</code>                                | <code>1</code> |
+  | <code>predict a value based on other values in the dataset. process of training a pred model is supervised learning.</code>                                                           | <code>involved with predicting a value based on other values in the dataset; process of training this type of model is known as supervised learning</code> | <code>1</code> |
+  | <code>predicting one value (the target variable) using other values</code>                                                                                                            | <code>predictive models are involved with predicting a value based on other values in the dataset.</code>                                                  | <code>1</code> |
+* Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "pairwise_angle_sim"
+  }
+  ```
+</details>
+<details><summary>apt</summary>
+#### apt
+* Dataset: [apt](https://huggingface.co/datasets/tasksource/apt) at [f6c07f6](https://huggingface.co/datasets/tasksource/apt/tree/f6c07f66d3eccebd36418885ce10aff295d436dd)
+* Size: 3,349 training samples
+* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                          | sentence2                                                                          | label                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                             | string                                                                             | int                                             |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 17.28 tokens</li><li>max: 124 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 16.99 tokens</li><li>max: 121 tokens</li></ul> | <ul><li>0: ~35.90%</li><li>1: ~64.10%</li></ul> |
+* Samples:
+  | sentence1                                                                            | sentence2                                                                              | label          |
+  |:-------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------|:---------------|
+  | <code>Come on.</code>                                                                | <code>Come on</code>                                                                   | <code>1</code> |
+  | <code>In Washington, the federal government remained closed for a second day.</code> | <code>The federal government in Washington was closed for a second day running.</code> | <code>1</code> |
+  | <code>The findings appear in next Friday's Physical Review Letters.</code>           | <code>Results published next Friday</code>                                             | <code>0</code> |
+* Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "pairwise_angle_sim"
+  }
+  ```
+</details>
 <details><summary>glue/stsb</summary>
 #### glue/stsb
 * Size: 800,000 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | label                                           | sentence1                                                                           | sentence2                                                                        |
+  |:--------|:------------------------------------------------|:------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|
+  | type    | int                                             | string                                                                              | string                                                                           |
+  | details | <ul><li>0: ~51.20%</li><li>1: ~48.80%</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 62.72 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>min: 7 tokens</li><li>mean: 8.01 tokens</li><li>max: 16 tokens</li></ul> |
 * Samples:
   | label          | sentence1                                                                                                                                                                                                 | sentence2                                   |
   |:---------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------|
 #### tomaarsen/natural-questions-hard-negatives
+* Dataset: tomaarsen/natural-questions-hard-negatives
 * Size: 96,658 training samples
 * Columns: <code>query</code>, <code>answer</code>, <code>negative_1</code>, <code>negative_2</code>, <code>negative_3</code>, <code>negative_4</code>, and <code>negative_5</code>
 * Approximate statistics based on the first 1000 samples:
   |         | query                                                                              | answer                                                                               | negative_1                                                                           | negative_2                                                                           | negative_3                                                                           | negative_4                                                                           | negative_5                                                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
   | type    | string                                                                             | string                                                                               | string                                                                               | string                                                                               | string                                                                               | string                                                                               | string                                                                               |
+  | details | <ul><li>min: 10 tokens</li><li>mean: 12.52 tokens</li><li>max: 26 tokens</li></ul> | <ul><li>min: 17 tokens</li><li>mean: 137.85 tokens</li><li>max: 556 tokens</li></ul> | <ul><li>min: 23 tokens</li><li>mean: 144.1 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>min: 13 tokens</li><li>mean: 142.73 tokens</li><li>max: 832 tokens</li></ul> | <ul><li>min: 15 tokens</li><li>mean: 146.37 tokens</li><li>max: 649 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 145.79 tokens</li><li>max: 549 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 142.01 tokens</li><li>max: 574 tokens</li></ul> |
 * Samples:
   | query                                                           | answer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | negative_1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | negative_2                                                                                                                                                                                                                                                                                                    | negative_3                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | negative_4                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | negative_5                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
   |:----------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 #### tomaarsen/gooaq-hard-negatives
+* Dataset: tomaarsen/gooaq-hard-negatives
 * Size: 800,000 training samples
 * Columns: <code>question</code>, <code>answer</code>, <code>negative_1</code>, <code>negative_2</code>, <code>negative_3</code>, <code>negative_4</code>, and <code>negative_5</code>
 * Approximate statistics based on the first 1000 samples:
 #### bclavie/msmarco-500k-triplets
+* Dataset: bclavie/msmarco-500k-triplets
 * Size: 500,000 training samples
 * Columns: <code>query</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
 #### sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1
+* Dataset: sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1
 * Size: 800,000 training samples
 * Columns: <code>query</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
 #### sentence-transformers/gooaq
+* Dataset: sentence-transformers/gooaq
 * Size: 800,000 training samples
 * Columns: <code>question</code> and <code>answer</code>
 * Approximate statistics based on the first 1000 samples:
 #### sentence-transformers/natural-questions
+* Dataset: sentence-transformers/natural-questions
 * Size: 100,231 training samples
 * Columns: <code>query</code> and <code>answer</code>
 * Approximate statistics based on the first 1000 samples:
   |         | query                                                                              | answer                                                                               |
   |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
   | type    | string                                                                             | string                                                                               |
+  | details | <ul><li>min: 10 tokens</li><li>mean: 12.47 tokens</li><li>max: 23 tokens</li></ul> | <ul><li>min: 17 tokens</li><li>mean: 138.32 tokens</li><li>max: 556 tokens</li></ul> |
 * Samples:
   | query                                                           | answer                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   |
   |:----------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 #### sentence-transformers/quora-duplicates
+* Dataset: sentence-transformers/quora-duplicates
 * Size: 101,762 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
   }
   ```
 </details>
+<details><summary>sentence-transformers/s2orc</summary>
+#### sentence-transformers/s2orc
+* Dataset: sentence-transformers/s2orc
+* Size: 800,000 training samples
+* Columns: <code>title</code> and <code>abstract</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | title                                                                             | abstract                                                                             |
+  |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
+  | type    | string                                                                            | string                                                                               |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 20.08 tokens</li><li>max: 83 tokens</li></ul> | <ul><li>min: 18 tokens</li><li>mean: 131.03 tokens</li><li>max: 332 tokens</li></ul> |
+* Samples:
+  | title                                                                                                           | abstract                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                          |
+  |:----------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+  | <code>Syntheses, Structures and Properties of Two Transition Metal-Flexible Ligand Coordination Polymers</code> | <code>Two coordination polymers based on 3,5-bis(4-carboxyphenylmethyloxy) benzoic acid (H3L), [M(HL)]·2H2O M = Mn(1), Co(2), have been synthesized under hydrothermal conditions. Their structures have been determined by single-crystal X-ray diffraction and further characterized by elemental analysis, IR spectra and TGA. The two complexes possess 3D framework with diamond channels resulting from the trans-configuration of the flexible ligand and three coordination modes, 3(η2, η1), 2(η1, η1), η1, of carboxyl groups in the ligand. The framework can be represented with Schlafli symbol of (48·66)(47·66). The wall of the channel consists of left- or right-handed helical polymeric chains. UV–visible–NIR and photoluminescence spectra, magnetic properties of 1 and 2 have also been discussed.</code> |
+  | <code>Discussion on the Influence and Development of Technical Aesthetics in Modern Landscape Design</code>     | <code>The source of technical aesthetics was introduced and its meaning was explained.The relations between technical aesthetics and modern landscpae design were discussed.The embodiment of technical aesthetics in landscpae design was discussed in the aspects of new material,new technology,new structureand new apparatus.It was put forward that the the development direction of technical aesthetics were tending to sensibility, native land and zoology.</code>                                                                                                                                                                                                                                                                                                                                                      |
+  | <code>GRIN optics for dual-band IR sensors (Conference Presentation)</code>                                     | <code>Graded index (GRIN) optics offer potential for both weight savings and increased performance but have until recently been limited to visible and NIR bands (wavelengths shorter than about 0.9 µm). NRL has developed glass-based IR-GRIN lenses compatible with SWIR-LWIR wavebands. Recent designs show the potential for significant SWaP reduction benefits and improved performance using IR-GRIN lens elements in dual-band, MWIR-LWIR sensors. The SWaP and performance advantages of IR-GRIN lenses in platform-relevant dual-band imagers will be presented.</code>                                                                                                                                                                                                                                                |
+* Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "mini_batch_size": 32,
+      "gather_across_devices": false
+  }
+  ```
+</details>
+<details><summary>sentence-transformers/codesearchnet</summary>
+#### sentence-transformers/codesearchnet
+* Dataset: sentence-transformers/codesearchnet
+* Size: 800,000 training samples
+* Columns: <code>comment</code> and <code>code</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | comment                                                                            | code                                                                                  |
+  |:--------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                                |
+  | details | <ul><li>min: 3 tokens</li><li>mean: 28.98 tokens</li><li>max: 142 tokens</li></ul> | <ul><li>min: 30 tokens</li><li>mean: 166.72 tokens</li><li>max: 1024 tokens</li></ul> |
+* Samples:
+  | comment                                                                                                                                  | code                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
+  |:-----------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+  | <code>Computes the new parent id for the node being moved.<br><br>@return int</code>                                                     | <code>protected function parentId()<br>	{<br>		switch ( $this->position )<br>		{<br>			case 'root':<br>				return null;<br><br>			case 'child':<br>				return $this->target->getKey();<br><br>			default:<br>				return $this->target->getParentId();<br>		}<br>	}</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
+  | <code>// SetWinSize overwrites the playlist's window size.</code>                                                                        | <code>func (p *MediaPlaylist) SetWinSize(winsize uint) error {<br>	if winsize > p.capacity {<br>		return errors.New("capacity must be greater than winsize or equal")<br>	}<br>	p.winsize = winsize<br>	return nil<br>}</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             |
+  | <code>Show the sidebar and squish the container to make room for the sidebar.<br>If hideOthers is true, hide other open sidebars.</code> | <code>function() {<br>        var options = this.options;<br><br>        if (options.hideOthers) {<br>            this.secondary.each(function() {<br>                var sidebar = $(this);<br><br>                if (sidebar.hasClass('is-expanded')) {<br>                    sidebar.toolkit('offCanvas', 'hide');<br>                }<br>            });<br>        }<br><br>        this.fireEvent('showing');<br><br>        this.container.addClass('move-' + this.opposite);<br><br>        this.element<br>            .reveal()<br>            .addClass('is-expanded')<br>            .aria('expanded', true);<br><br>        if (options.stopScroll) {<br>            $('body').addClass('no-scroll');<br>        }<br><br>        this.fireEvent('shown');<br>    }</code> |
+* Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "mini_batch_size": 32,
+      "gather_across_devices": false
+  }
+  ```
+</details>
+<details><summary>sentence-transformers/stackexchange-duplicates</summary>
+#### sentence-transformers/stackexchange-duplicates
+* Dataset: sentence-transformers/stackexchange-duplicates
+* Size: 250,460 training samples
+* Columns: <code>body1</code> and <code>body2</code>
+* Approximate statistics based on the first 1000 samples:
+  |         | body1                                                                                 | body2                                                                                 |
+  |:--------|:--------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
+  | type    | string                                                                                | string                                                                                |
+  | details | <ul><li>min: 13 tokens</li><li>mean: 174.01 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 156.88 tokens</li><li>max: 1024 tokens</li></ul> |
+* Samples:
+  | body1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                    | body2                                                                                                                                                                                                                                                                                                                                  |
+  |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
+  | <code>I've been wondering about this for years.  It seems like a pretty obvious question, so I'm surprised not to have found it addressed among the other Tolkien minutiae on this site.  Hopefully I haven't missed it, but anyway, here goes...  In Tolkien's Middle-Earth writings, Evil cannot create things, only twist and warp what already exists.  Thus, Orcs are twisted Elves, Trolls are twisted Ents, etc.  So then, what's the original source for Dragons?  They look pretty original to me!  The only template that seems even remotely possible is the Eagles, as they're both powerful fliers, but the connection seems very remote indeed.  Also, as twisted copies Orcs and Trolls are markedly inferior to Elves and Ents respectively, but I'm not aware of any text describing Dragons as inferior to Eagles.</code>                                                                                                                                                                                                              | <code>All that I know of Smaug is that he (she?) came out of nowhere to attack and conquer Erebor. Where exactly did he come from? In fact, what are the origins of dragons? Did Ilúvatar create them or did they come from somewhere else?</code>                                                                                     |
+  | <code>Hi i have some data which coming out from database in form of table like this, first i match some data with searching and then display it on page now  i need to download it as csv file format please help me check my code and i'm new in php. please check image too for the reference and please please help me  //import.php // echo "&lt;pre&gt;"; //print_r($_POST);die(); $keyword = $_POST['keyword']; $csvname = $_POST['csv_file'];  ?&gt;  &lt;table border ="1"&gt;     &lt;thead&gt;         &lt;tr&gt;             &lt;th&gt;id&lt;/th&gt;             &lt;th&gt;title&lt;/th&gt;             &lt;th&gt;count&lt;/th&gt;         &lt;/tr&gt;     &lt;/thead&gt;  &lt;?php  $row = 0; if (($handle = fopen("idata.csv", "r",)) !== FALSE) {     while (($data = fgetcsv($handle, 1000, ",")) !== FALSE) {            $num = count($data);         // echo "&lt;p&gt; $num fields in line $row: &lt;br /&gt;&lt;/p&gt;\n";         $row++;         for ($c=0; $c &lt; $num; $c++) {             // echo $data[$c] . "&lt;br...</code> | <code>What is the most efficient way to convert a MySQL query to CSV in PHP please?  It would be best to avoid temp files as this reduces portability (dir paths and setting file-system permissions required).  The CSV should also include one top line of field names.</code>                                                       |
+  | <code>Following along in tutorials I see the blur filter being used. I am using Blender 2.69 and I can't locate it visually or even with a search. Actually, there is no "Filters" category at all.  Do I have to download something to get it?</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                   | <code>I have been following  tutorial until I started adding nodes. The problem is that he has completely different nodes than I have. Even nodes that are created at start are different (I have Material and Output and he has Render Layers and Composite). Have I missed something or should I use different nodes than he?</code> |
+* Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
+  ```json
+  {
+      "scale": 20.0,
+      "similarity_fct": "cos_sim",
+      "mini_batch_size": 32,
+      "gather_across_devices": false
+  }
+  ```
+</details>
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `per_device_train_batch_size`: 256
+- `learning_rate`: 8e-05
 - `weight_decay`: 1e-06
 - `num_train_epochs`: 2
 - `warmup_ratio`: 0.1
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 256
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
+- `learning_rate`: 8e-05
 - `weight_decay`: 1e-06
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `seed`: 42
 - `data_seed`: None
 - `jit_mode_eval`: False
 - `bf16`: False
 - `fp16`: True
 - `fp16_opt_level`: O1
 - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
 - `fsdp_transformer_layer_cls_to_wrap`: None
 - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
+- `parallelism_config`: None
 - `deepspeed`: None
 - `label_smoothing_factor`: 0.0
 - `optim`: adamw_torch
 - `adafactor`: False
 - `group_by_length`: False
 - `length_column_name`: length
+- `project`: huggingface
+- `trackio_space_id`: trackio
 - `ddp_find_unused_parameters`: None
 - `ddp_bucket_cap_mb`: None
 - `ddp_broadcast_buffers`: False
 - `torch_compile_backend`: None
 - `torch_compile_mode`: None
 - `include_tokens_per_second`: False
+- `include_num_input_tokens_seen`: no
 - `neftune_noise_alpha`: None
 - `optim_target_modules`: None
 - `batch_eval_metrics`: False
 - `use_liger_kernel`: False
 - `liger_kernel_config`: None
 - `eval_use_gather_object`: False
+- `average_tokens_across_devices`: True
 - `prompts`: None
 - `batch_sampler`: batch_sampler
 - `multi_dataset_batch_sampler`: proportional
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
+| 0.0202 | 500   | 4.5778        |
+| 0.0404 | 1000  | 3.5556        |
+| 0.0606 | 1500  | 2.5948        |
+| 0.0808 | 2000  | 2.3723        |
+| 0.1011 | 2500  | 2.1149        |
+| 0.1213 | 3000  | 2.3977        |
+| 0.1415 | 3500  | 2.3535        |
+| 0.1617 | 4000  | 1.9057        |
+| 0.1819 | 4500  | 2.1313        |
+| 0.2021 | 5000  | 2.1719        |
+| 0.2223 | 5500  | 1.887         |
+| 0.2425 | 6000  | 2.1792        |
+| 0.2627 | 6500  | 2.3001        |
+| 0.2830 | 7000  | 2.0002        |
+| 0.3032 | 7500  | 1.9358        |
+| 0.3234 | 8000  | 1.9074        |
+| 0.3436 | 8500  | 1.9204        |
+| 0.3638 | 9000  | 1.8991        |
+| 0.3840 | 9500  | 2.0086        |
+| 0.4042 | 10000 | 1.8229        |
+| 0.4244 | 10500 | 1.7437        |
+| 0.4446 | 11000 | 2.2012        |
+| 0.4649 | 11500 | 1.6898        |
+| 0.4851 | 12000 | 2.1212        |
+| 0.5053 | 12500 | 1.8014        |
+| 0.5255 | 13000 | 2.1112        |
+| 0.5457 | 13500 | 1.885         |
+| 0.5659 | 14000 | 1.6889        |
+| 0.5861 | 14500 | 1.6377        |
+| 0.6063 | 15000 | 1.8526        |
+| 0.6265 | 15500 | 1.8912        |
+| 0.6468 | 16000 | 1.8621        |
+| 0.6670 | 16500 | 1.743         |
+| 0.6872 | 17000 | 1.5893        |
+| 0.7074 | 17500 | 1.9079        |
+| 0.7276 | 18000 | 1.5885        |
+| 0.7478 | 18500 | 1.9128        |
+| 0.7680 | 19000 | 1.6654        |
+| 0.7882 | 19500 | 1.7099        |
+| 0.8084 | 20000 | 1.4688        |
+| 0.8287 | 20500 | 1.3844        |
+| 0.8489 | 21000 | 1.7908        |
+| 0.8691 | 21500 | 1.7075        |
+| 0.8893 | 22000 | 1.8114        |
+| 0.9095 | 22500 | 1.5198        |
+| 0.9297 | 23000 | 1.8605        |
+| 0.9499 | 23500 | 1.6604        |
+| 0.9701 | 24000 | 1.5891        |
+| 0.9903 | 24500 | 1.5906        |
+| 1.0106 | 25000 | 1.5027        |
+| 1.0308 | 25500 | 1.7599        |
+| 1.0510 | 26000 | 1.4124        |
+| 1.0712 | 26500 | 1.5636        |
+| 1.0914 | 27000 | 1.6126        |
+| 1.1116 | 27500 | 1.4625        |
+| 1.1318 | 28000 | 1.4467        |
+| 1.1520 | 28500 | 1.6898        |
+| 1.1722 | 29000 | 1.5088        |
+| 1.1924 | 29500 | 1.5158        |
+| 1.2127 | 30000 | 1.5266        |
+| 1.2329 | 30500 | 1.465         |
+| 1.2531 | 31000 | 1.5687        |
+| 1.2733 | 31500 | 1.4397        |
+| 1.2935 | 32000 | 1.7929        |
+| 1.3137 | 32500 | 1.5893        |
+| 1.3339 | 33000 | 1.4727        |
+| 1.3541 | 33500 | 1.6007        |
+| 1.3743 | 34000 | 1.2833        |
+| 1.3946 | 34500 | 1.5541        |
+| 1.4148 | 35000 | 1.3354        |
+| 1.4350 | 35500 | 1.4509        |
+| 1.4552 | 36000 | 1.6065        |
+| 1.4754 | 36500 | 1.6393        |
+| 1.4956 | 37000 | 1.3914        |
+| 1.5158 | 37500 | 1.3584        |
+| 1.5360 | 38000 | 1.5504        |
+| 1.5562 | 38500 | 1.2169        |
+| 1.5765 | 39000 | 1.4081        |
+| 1.5967 | 39500 | 1.5506        |
+| 1.6169 | 40000 | 1.473         |
+| 1.6371 | 40500 | 1.2517        |
+| 1.6573 | 41000 | 1.7644        |
+| 1.6775 | 41500 | 1.4237        |
+| 1.6977 | 42000 | 1.295         |
+| 1.7179 | 42500 | 1.4951        |
+| 1.7381 | 43000 | 1.4389        |
+| 1.7584 | 43500 | 1.5742        |
+| 1.7786 | 44000 | 1.4843        |
+| 1.7988 | 44500 | 1.4806        |
+| 1.8190 | 45000 | 1.3674        |
+| 1.8392 | 45500 | 1.329         |
+| 1.8594 | 46000 | 1.7644        |
+| 1.8796 | 46500 | 1.36          |
+| 1.8998 | 47000 | 1.2003        |
+| 1.9200 | 47500 | 1.233         |
+| 1.9403 | 48000 | 1.5147        |
+| 1.9605 | 48500 | 1.3838        |
+| 1.9807 | 49000 | 1.4928        |
 ### Framework Versions
 - Python: 3.12.10
 - Sentence Transformers: 5.1.2
+- Transformers: 4.57.3
 - PyTorch: 2.7.1+cu126
 - Accelerate: 1.7.0
 - Datasets: 3.6.0
+- Tokenizers: 0.22.1
 ## Citation

config.json CHANGED Viewed

@@ -13,6 +13,7 @@
   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,
@@ -41,7 +42,6 @@
   "sep_token_id": 50282,
   "sparse_pred_ignore_index": -100,
   "sparse_prediction": false,
-  "torch_dtype": "float32",
-  "transformers_version": "4.53.2",
   "vocab_size": 50368
 }

   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
+  "dtype": "float32",
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,
   "sep_token_id": 50282,
   "sparse_pred_ignore_index": -100,
   "sparse_prediction": false,
+  "transformers_version": "4.57.3",
   "vocab_size": 50368
 }

config_sentence_transformers.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "model_type": "SentenceTransformer",
   "__version__": {
     "sentence_transformers": "5.1.2",
-    "transformers": "4.53.2",
     "pytorch": "2.7.1+cu126"
   },
   "prompts": {

   "model_type": "SentenceTransformer",
   "__version__": {
     "sentence_transformers": "5.1.2",
+    "transformers": "4.57.3",
     "pytorch": "2.7.1+cu126"
   },
   "prompts": {

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:247694b3da20e9e4aa3e20a2f428b6b0dfa543d4871ec19af5edaebf444a200e
 size 127538496

 version https://git-lfs.github.com/spec/v1
+oid sha256:895b49d6283aa8bc1a1bcf30e93046f410c8c32d946f0ee02e688c55f602024c
 size 127538496

sentence_bert_config.json CHANGED Viewed

@@ -1,4 +1,4 @@
 {
-    "max_seq_length": 512,
     "do_lower_case": false
 }

 {
+    "max_seq_length": 1024,
     "do_lower_case": false
 }

tokenizer.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 512,
     "strategy": "LongestFirst",
     "stride": 0
   },

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 1024,
     "strategy": "LongestFirst",
     "stride": 0
   },

tokenizer_config.json CHANGED Viewed

@@ -937,7 +937,7 @@
     "input_ids",
     "attention_mask"
   ],
-  "model_max_length": 512,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",

     "input_ids",
     "attention_mask"
   ],
+  "model_max_length": 1024,
   "pad_token": "[PAD]",
   "sep_token": "[SEP]",
   "tokenizer_class": "PreTrainedTokenizerFast",