tasksource
/

ettin-32m-embed

@@ -37,27 +37,30 @@ widget:
     \ pediatrician, or paediatrician. The word pediatrics and its cognates mean healer\
     \ of children; they derive from two Greek words: Ï\x80Î±á¿\x96Ï\x82 (pais child)\
     \ and á¼°Î±Ï\x84Ï\x81Ï\x8CÏ\x82 (iatros doctor, healer)."
-- source_sentence: Creek Township borders Elsinboro Township , Pennsville Township
-    and Salem .
   sentences:
-  - Today , Galesburg-Augusta Community Schools consists of a primary school and a
-    high school in Galesburg and a middle school in Augusta .
-  - Elsinboro Township borders with the Lower Alloways Creek Township , Pennsville
-    Township and Salem .
-  - In 1953 , he married the actress Gilda Neeltje , sister of the actress Diane Holland
-    .
-- source_sentence: A man is riding on one wheel on a motorcycle.
   sentences:
-  - A person is performing tricks on a motorcycle.
-  - A boy jumping in the air on the beach.
-  - A woman is pouring ingredients into a frying pan.
-- source_sentence: '''Why don''t you find out?'
   sentences:
-  - He is suggesting that the lack of effort focusing on the concept is making it
-    seem unrealistic.
-  - The military stated that the 244th Engineer Battalion has been handling the construction
-    of playgrounds, cleaning up the rubble and restoring irrigation services in Iraq.
-  - Why you haven't find out?.
 - source_sentence: what are the three subatomic particles called?
   sentences:
   - Subatomic particles include electrons, the negatively charged, almost massless
@@ -168,7 +171,7 @@ print(query_embeddings.shape, document_embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
-# tensor([[ 0.6600, -0.0148,  0.0229]])
 ```
 <!--
@@ -221,13 +224,13 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
-  | details | <ul><li>min: 11 tokens</li><li>mean: 27.65 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 27.73 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>0: ~57.50%</li><li>1: ~42.50%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                                               | sentence2                                                                                                                                                                  | label          |
-  |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>Ceremonial music ( `` rokon fada '' ) is listed as a status symbol , and musicians are generally chosen for political reasons as opposed to musical ones .</code> | <code>Ceremonial music ( `` rokon fada '' ) is performed as a status symbol , and musicians are generally chosen for musical reasons as opposed to political ones .</code> | <code>0</code> |
-  | <code>In 1989 he travelled to South Africa , Johannesburg and Angola , Mozambique on a peace-seeking mission .</code>                                                   | <code>In 1989 , he traveled to Mozambique , Johannesburg , and Angola , South Africa on a peace-seeking mission .</code>                                                   | <code>1</code> |
-  | <code>In this way , the Nestorian faith was established in the East under tragic signs .</code>                                                                         | <code>In this way , under Nestorian auspices , the tragic faith was established in the East .</code>                                                                       | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -244,16 +247,16 @@ You can finetune this model on your own dataset.
 * Size: 11,004 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                          | sentence2                                                                          | label                                           |
-  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                             | string                                                                             | int                                             |
-  | details | <ul><li>min: 11 tokens</li><li>mean: 27.23 tokens</li><li>max: 52 tokens</li></ul> | <ul><li>min: 11 tokens</li><li>mean: 27.29 tokens</li><li>max: 53 tokens</li></ul> | <ul><li>0: ~33.10%</li><li>1: ~66.90%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                                         | sentence2                                                                                                                                                                                        | label          |
-  |:------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>Tony Blair has taken a hardline stance arguing nothing should be done to lessen the pressure on Mugabe at the gathering in the capital Abuja .</code>       | <code>The Prime Minister has taken a hardline stance arguing nothing should be done to lessen the pressure on Mugabe .</code>                                                                    | <code>0</code> |
-  | <code>The identical rovers will act as robotic geologists , searching for evidence of past water .</code>                                                         | <code>The rovers act as robotic geologists , moving on six wheels .</code>                                                                                                                       | <code>0</code> |
-  | <code>" We make no apologies for finding every legal way possible to protect the American public from further terrorist attack , " Barbara Comstock said .</code> | <code>" We make no apologies for finding every legal way possible to protect the American public from further terrorist attacks , " said Barbara Comstock , Ashcroft 's press secretary .</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -273,13 +276,13 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                             | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                                | int                                             |
-  | details | <ul><li>min: 7 tokens</li><li>mean: 13.65 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 28 tokens</li><li>mean: 318.06 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>0: ~30.20%</li><li>1: ~69.80%</li></ul> |
 * Samples:
-  | sentence1                                                  | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
-  |:-----------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>Batman: The Killing Joke features characters.</code> | <code>notice. Cantonese Pinyin -LRB- , also known as 教院式拼音方案 -RRB- is a romanization system for Cantonese developed by Rev. Yu Ping Chiu in 1971 , and subsequently modified by the Education Department -LRB- merged into the Education and Manpower Bureau since 2003 -RRB- of Hong Kong and Prof. Zhan Bohui of the Chinese Dialects Research Centre of the Jinan University , Guangdong , PRC , and honorary professor of the School of Chinese , University of Hong Kong .. romanization. romanization. Cantonese. Cantonese. Education and Manpower Bureau. Education and Manpower Bureau. Zhan Bohui. Zhan Bohui. It is the only romanization system accepted by Education and Manpower Bureau of Hong Kong and Hong Kong Examinations and Assessment Authority .. romanization. romanization. Education and Manpower Bureau. Education and Manpower Bureau. Hong Kong Examinations and Assessment Authority. Hong Kong Examinations and Assessment Authority. The formal and short forms of the system 's Chinese names mean respectiv...</code> | <code>1</code> |
-  | <code>Jon Snow is played by a person.</code>               | <code>Cao'an is a temple in Jinjiang , Fujian .. Originally constructed by Chinese Manicheans , it was viewed by later worshipers as a Buddhist temple .. Manicheans. Manichaeism. This `` Manichean temple in Buddhist disguise ''. is seen by modern experts on Manichaeism as `` the only extant Manichean temple in China '' , or `` the only Manichean building which has survived intact '' .</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | <code>1</code> |
-  | <code>Scotland includes islands.</code>                    | <code>Scotland -LRB- -LSB- ˈskɒt.lənd -RSB- Scots  : -LSB- - scoˈskɔt.lənd -RSB- Alba -LSB- ˈalˠapə -RSB- -RRB- is a country that is part of the United Kingdom and covers the northern third of the island of Great Britain .. Scots. Scots language. Scotland. Scots Law. Alba. Alba. country. country. part. Countries of the United Kingdom. United Kingdom. United Kingdom. Great Britain. Great Britain. It shares a border with England to the south , and is otherwise surrounded by the Atlantic Ocean , with the North Sea to the east and the North Channel and Irish Sea to the south-west .. England. England. Atlantic Ocean. Atlantic Ocean. North Sea. North Sea. North Channel. North Channel ( British Isles ). Irish Sea. Irish Sea. In addition to the mainland , the country is made up of more than 790 islands , including the Northern Isles and the Hebrides .. country. country. Northern Isles. Northern Isles. Hebrides. Hebrides. The Kingdom of Scotland emerged as an independent sovereign state in the Early ...</code> | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -299,13 +302,13 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                         | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                            | int                                             |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 22.21 tokens</li><li>max: 61 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 21.48 tokens</li><li>max: 49 tokens</li></ul> | <ul><li>0: ~54.80%</li><li>1: ~45.20%</li></ul> |
 * Samples:
-  | sentence1                                                                                                                                    | sentence2                                                                                                                                                                         | label          |
-  |:---------------------------------------------------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
-  | <code>access to device itself  application specific data (network services, dns, html, http, etc)</code>                                     | <code>(upper layer data)facilitates communication between such programs and lower-layer network services. high-level apis, including resource sharing, remote file access.</code> | <code>0</code> |
-  | <code>an important element of information management, but it is just one part of a larger whole</code>                                       | <code>converting facts and figures into useful information</code>                                                                                                                 | <code>0</code> |
-  | <code>web site that has a field for you to type in a search query, as it will search the internet for you using your search criteria.</code> | <code>web-based search tool that locates a web page using a keyword</code>                                                                                                        | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -322,16 +325,16 @@ You can finetune this model on your own dataset.
 * Size: 10,047 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                          | sentence2                                                                          | label                                           |
-  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
-  | type    | string                                                                             | string                                                                             | int                                             |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 17.32 tokens</li><li>max: 213 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 16.46 tokens</li><li>max: 121 tokens</li></ul> | <ul><li>0: ~35.80%</li><li>1: ~64.20%</li></ul> |
 * Samples:
-  | sentence1                                                         | sentence2                                                                | label          |
-  |:------------------------------------------------------------------|:-------------------------------------------------------------------------|:---------------|
-  | <code>Watch out.</code>                                           | <code>U.S. Bank</code>                                                   | <code>0</code> |
-  | <code>Oh! we spent all night, used all the fancy machines.</code> | <code>We spent all night using the luxurious equipment.</code>           | <code>1</code> |
-  | <code>I'm willing to give you all this information...</code>      | <code>This information, all of it, I'm inclined to provide you...</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
@@ -351,13 +354,13 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 14.68 tokens</li><li>max: 57 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 14.84 tokens</li><li>max: 68 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 2.64</li><li>max: 5.0</li></ul> |
 * Samples:
-  | sentence1                                                                                     | sentence2                                                                                                      | label                           |
-  |:----------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------|:--------------------------------|
-  | <code>Mandela's condition has 'improved'</code>                                               | <code>Mandela's condition has 'worsened over past 48 hours'</code>                                             | <code>1.0</code>                |
-  | <code>the cfe is very important for european security.</code>                                 | <code>the cfe is a cornerstone of european security.</code>                                                    | <code>5.0</code>                |
-  | <code>The Nasdaq fell about 1.3% for the month, snapping a seven-month winning streak.</code> | <code>The Nasdaq is down roughly 0.4 percent for the month, on track to snap a 7-month streak of gains.</code> | <code>2.4000000953674316</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -377,13 +380,13 @@ You can finetune this model on your own dataset.
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
-  | details | <ul><li>min: 6 tokens</li><li>mean: 12.25 tokens</li><li>max: 28 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 12.11 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 3.51</li><li>max: 5.0</li></ul> |
 * Samples:
-  | sentence1                                             | sentence2                                                                           | label                           |
-  |:------------------------------------------------------|:------------------------------------------------------------------------------------|:--------------------------------|
-  | <code>A cold cyclist is celebrating</code>            | <code>A bike is being held over his head by a bicyclist in a group of people</code> | <code>2.299999952316284</code>  |
-  | <code>Nobody is cutting a capsicum into pieces</code> | <code>The person is slicing a clove of garlic into pieces</code>                    | <code>3.0999999046325684</code> |
-  | <code>A woman is not cutting shrimps</code>           | <code>A man is chopping butter into a container</code>                              | <code>1.7999999523162842</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -400,16 +403,16 @@ You can finetune this model on your own dataset.
 * Size: 14,280 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | label                                                          | sentence1                                                                         | sentence2                                                                          |
-  |:--------|:---------------------------------------------------------------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|
-  | type    | float                                                          | string                                                                            | string                                                                             |
-  | details | <ul><li>min: 0.0</li><li>mean: 3.13</li><li>max: 5.0</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 18.95 tokens</li><li>max: 91 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 17.55 tokens</li><li>max: 269 tokens</li></ul> |
 * Samples:
-  | label            | sentence1                                                                                                              | sentence2                                                                                                          |
-  |:-----------------|:-----------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------|
-  | <code>4.2</code> | <code>I am calling BS!!! NYTimes: Morsi Says His Slurs of Jews Were Taken Out of Context</code>                        | <code>Morsi Says Slurs of Jews Were Taken Out of Context</code>                                                    |
-  | <code>3.0</code> | <code>The driver of the coach tried to avoid it by swerving hard, but still grazed the right side of the lorry.</code> | <code>The driver of the last to try to avoid it through a sudden move, but he fell short by his right side.</code> |
-  | <code>5.0</code> | <code>create a mess or disorder</code>                                                                                 | <code>make a mess of or create disorder in.</code>                                                                 |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
@@ -728,15 +731,11 @@ You can finetune this model on your own dataset.
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
-- `per_device_train_batch_size`: 360
-- `learning_rate`: 8e-05
-- `weight_decay`: 5e-05
 - `num_train_epochs`: 1
-- `warmup_ratio`: 0.03
 - `fp16`: True
-- `gradient_checkpointing`: True
-- `torch_compile`: True
-- `torch_compile_backend`: inductor
 #### All Hyperparameters
 <details><summary>Click to expand</summary>
@@ -745,15 +744,15 @@ You can finetune this model on your own dataset.
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 360
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
-- `learning_rate`: 8e-05
-- `weight_decay`: 5e-05
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
@@ -762,7 +761,7 @@ You can finetune this model on your own dataset.
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
-- `warmup_ratio`: 0.03
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
@@ -828,7 +827,7 @@ You can finetune this model on your own dataset.
 - `hub_private_repo`: None
 - `hub_always_push`: False
 - `hub_revision`: None
-- `gradient_checkpointing`: True
 - `gradient_checkpointing_kwargs`: None
 - `include_inputs_for_metrics`: False
 - `include_for_metrics`: []
@@ -842,8 +841,8 @@ You can finetune this model on your own dataset.
 - `torchdynamo`: None
 - `ray_scope`: last
 - `ddp_timeout`: 1800
-- `torch_compile`: True
-- `torch_compile_backend`: inductor
 - `torch_compile_mode`: None
 - `include_tokens_per_second`: False
 - `include_num_input_tokens_seen`: no
@@ -866,45 +865,43 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
-| 0.0251 | 500   | 5.0537        |
-| 0.0501 | 1000  | 3.6206        |
-| 0.0752 | 1500  | 3.249         |
-| 0.1003 | 2000  | 3.5885        |
-| 0.1254 | 2500  | 3.2479        |
-| 0.1504 | 3000  | 3.2033        |
-| 0.1755 | 3500  | 2.7123        |
-| 0.2006 | 4000  | 2.8247        |
-| 0.2257 | 4500  | 2.7694        |
-| 0.2507 | 5000  | 3.0215        |
-| 0.2758 | 5500  | 2.6723        |
-| 0.3009 | 6000  | 2.8297        |
-| 0.3259 | 6500  | 2.4046        |
-| 0.3510 | 7000  | 2.2289        |
-| 0.3761 | 7500  | 2.4628        |
-| 0.4012 | 8000  | 2.4032        |
-| 0.4262 | 8500  | 2.5024        |
-| 0.4513 | 9000  | 2.0948        |
-| 0.4764 | 9500  | 2.4389        |
-| 0.5015 | 10000 | 2.4771        |
-| 0.5265 | 10500 | 2.6465        |
-| 0.5516 | 11000 | 2.5892        |
-| 0.5767 | 11500 | 2.3557        |
-| 0.6017 | 12000 | 2.2359        |
-| 0.6268 | 12500 | 2.5839        |
-| 0.6519 | 13000 | 2.4216        |
-| 0.6770 | 13500 | 2.3211        |
-| 0.7020 | 14000 | 2.1171        |
-| 0.7271 | 14500 | 2.1206        |
-| 0.7522 | 15000 | 2.2557        |
-| 0.7773 | 15500 | 2.2815        |
-| 0.8023 | 16000 | 2.0951        |
-| 0.8274 | 16500 | 2.3415        |
-| 0.8525 | 17000 | 2.2792        |
-| 0.8775 | 17500 | 2.3113        |
-| 0.9026 | 18000 | 2.1932        |
-| 0.9277 | 18500 | 2.1134        |
-| 0.9528 | 19000 | 1.9995        |
-| 0.9778 | 19500 | 1.8916        |
 ### Framework Versions

     \ pediatrician, or paediatrician. The word pediatrics and its cognates mean healer\
     \ of children; they derive from two Greek words: Ï\x80Î±á¿\x96Ï\x82 (pais child)\
     \ and á¼°Î±Ï\x84Ï\x81Ï\x8CÏ\x82 (iatros doctor, healer)."
+- source_sentence: However , in 1919 , concluded that no more operational awards would
+    be made for the recently decreed war .
   sentences:
+  - At executive level , EEAA represents the central arm of the ministry .
+  - In 1919 , however , no operational awards would be made for the recently concluded
+    war .
+  - He was asked his opinion about the books `` Mission to Moscow '' by Joseph E.
+    Davies and `` One World '' by Wendell Willkie .
+- source_sentence: Twelve killed in bomb blast on Pakistani train
   sentences:
+  - Five killed by bomb blast in East India
+  - Five million citizens get unofficial salary in Ukraine
+  - Above that, seniors would be responsible for 100 percent of drug costs until the
+    out-of-pocket total reaches $3,600.
+- source_sentence: Pen Hadow, who became the first person to reach the geographic
+    North Pole unsupported from Canada, has just over two days of rations left.
   sentences:
+  - Remnants of highly enriched uranium were found near an Iranian nuclear facility
+    by United Nations inspectors, deepening fears that Iran possibly has a secret
+    nuclear weapons program.
+  - However, the singer believes that artists similar to his self should not receive
+    any blame.
+  - Pen Hadow, the first person to to reach the North Pole, has only a little more
+    than two days of rations left.
 - source_sentence: what are the three subatomic particles called?
   sentences:
   - Subatomic particles include electrons, the negatively charged, almost massless
 # Get the similarity scores for the embeddings
 similarities = model.similarity(query_embeddings, document_embeddings)
 print(similarities)
+# tensor([[0.6104, 0.0070, 0.0514]])
 ```
 <!--
   |         | sentence1                                                                          | sentence2                                                                          | label                                           |
   |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                             | string                                                                             | int                                             |
+  | details | <ul><li>min: 10 tokens</li><li>mean: 27.74 tokens</li><li>max: 54 tokens</li></ul> | <ul><li>min: 10 tokens</li><li>mean: 27.73 tokens</li><li>max: 55 tokens</li></ul> | <ul><li>0: ~54.60%</li><li>1: ~45.40%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                                                   | sentence2                                                                                                                                                                                | label          |
+  |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Göttsche received international acclaim with his formula for the generating function for the Hilbert numbers of the Betti scheme of points on an algebraic surface :</code>           | <code>With his formula for the producing function for the Betti - numbers of the Hilbert scheme of points on an algebraic surface , Göttsche received international recognition :</code> | <code>0</code> |
+  | <code>The former AFL players Tarkyn Lockyer ( Collingwood ) and Ryan Brabazon ( Sydney ) , Jason Mandzij ( Gold Coast ) , started their football careers and played for the Kangas .</code> | <code>Former AFL players Ryan Brabazon ( Collingwood ) and Tarkyn Lockyer ( Sydney ) , Jason Mandzij ( Gold Coast ) started their football careers playing for the Kangas .</code>       | <code>0</code> |
+  | <code>Potter married in 1945 . He and his wife Anne ( a weaver ) had two children , Julian ( born 1947 ) and Mary ( born 1952 ) .</code>                                                    | <code>He and his wife Anne ( a weaver ) had two children , Julian ( born 1947 ) and Mary ( born in 1952 ) .</code>                                                                       | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 * Size: 11,004 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                          | sentence2                                                                         | label                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                             | string                                                                            | int                                             |
+  | details | <ul><li>min: 10 tokens</li><li>mean: 27.33 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 12 tokens</li><li>mean: 27.3 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>0: ~32.40%</li><li>1: ~67.60%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                          | sentence2                                                                                                                                                                                   | label          |
+  |:---------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Passed in 1999 but never put into effect , the law would have made it illegal for bar and restaurant patrons to light up .</code>            | <code>Passed in 1999 but never put into effect , the smoking law would have prevented bar and restaurant patrons from lighting up , but exempted private clubs from the regulation .</code> | <code>0</code> |
+  | <code>" Indeed , Iran should be put on notice that efforts to try to remake Iraq in their image will be aggressively put down , " he said .</code> | <code>" Iran should be on notice that attempts to remake Iraq in Iran 's image will be aggressively put down , " he said .</code>                                                           | <code>1</code> |
+  | <code>But U.S. troops will not shrink from mounting raids and attacking their foes when their locations can be pinpointed .</code>                 | <code>But American troops will not shrink from mounting raids in the locations of their foes that can be pinpointed .</code>                                                                | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
   |         | sentence1                                                                         | sentence2                                                                             | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                                | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 13.83 tokens</li><li>max: 33 tokens</li></ul> | <ul><li>min: 29 tokens</li><li>mean: 340.09 tokens</li><li>max: 1024 tokens</li></ul> | <ul><li>0: ~31.40%</li><li>1: ~68.60%</li></ul> |
 * Samples:
+  | sentence1                                                                               | sentence2                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                | label          |
+  |:----------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>Performance (film) is a religion.</code>                                          | <code>The associative model of data is a data model for database systems .. data model. data model. database. database. Other data models , such as the relational model and the object data model , are record-based .. data model. data model. relational model. relational model. These models involve encompassing attributes about a thing , such as a car , in a record structure .. Such attributes might be registration , colour , make , model , etc. .. In the associative model , everything which has `` discrete independent existence '' is modeled as an entity , and relationships between them are modeled as associations .. The granularity at which data is represented is similar to schemes presented by Chen -LRB- Entity-relationship model -RRB- ; Bracchi , Paolini and Pelagatti -LRB- Binary Relations -RRB- ; and Senko -LRB- The Entity Set Model -RRB- .. Entity-relationship model. Entity-relationship model. A number of claims made about the model by Simon Williams , in his book The Associative Model ...</code> | <code>1</code> |
+  | <code>American Gods (TV series) has one showrunner, whose name is Greg Berlanti.</code> | <code>American Gods is an American television series based on the novel of the same name , written by Neil Gaiman and originally published in 2001 .. American Gods. American Gods. Neil Gaiman. Neil Gaiman. novel of the same name. American Gods. The television series was developed by Bryan Fuller and Michael Green for the premium cable network Starz .. Bryan Fuller. Bryan Fuller. Michael Green. Michael Green ( writer ). Starz. Starz. Fuller and Green are the showrunners for the series .. Gaiman serves as an executive producer along with Fuller , Green , Craig Cegielski , Stefanie Berk , and Thom Beers .. Thom Beers. Thom Beers. The first episode premiered on the Starz network and through their streaming application on April 30 , 2017 .. Starz. Starz. In May 2017 , the series was renewed for a second season .</code>                                                                                                                                                                                                | <code>0</code> |
+  | <code>The Ren & Stimpy Show was one of the original four Nicktoons.</code>              | <code>Cloud was a browser-based operating system created by Good OS LLC , a Los Angeles-based corporation .. Los Angeles. Los Angeles. The company initially launched a Linux distribution called gOS which is heavily based on Ubuntu , now in its third incarnation .. gOS. gOS ( operating system ). Ubuntu. Ubuntu ( operating system )</code>                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
   |         | sentence1                                                                         | sentence2                                                                         | label                                           |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
   | type    | string                                                                            | string                                                                            | int                                             |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 22.32 tokens</li><li>max: 61 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 22.15 tokens</li><li>max: 46 tokens</li></ul> | <ul><li>0: ~54.50%</li><li>1: ~45.50%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                                 | sentence2                                                                                                               | label          |
+  |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>the process of shrinking the size of a file by removing data or recoding it more efficiently</code>                                                                 | <code>reducing the amount of space needed to store a piece of data/bandwidth to transmit it (ex. zip files)</code>      | <code>0</code> |
+  | <code>the siem software can ensure that the time is the same across devices so the security events across devices are recorded at the same time.</code>                   | <code>feature of a siem that makes sure all products are synced up so they are running with the same timestamps.</code> | <code>1</code> |
+  | <code>a model that is part of a dssa to describe the context and domain semantics important to understand a reference architecture and its architectural decisions</code> | <code>provide a means of information about that class of system and of comparing different architectures</code>         | <code>0</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
 * Size: 10,047 training samples
 * Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>label</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | sentence1                                                                          | sentence2                                                                         | label                                           |
+  |:--------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:------------------------------------------------|
+  | type    | string                                                                             | string                                                                            | int                                             |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 17.82 tokens</li><li>max: 124 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 17.3 tokens</li><li>max: 143 tokens</li></ul> | <ul><li>0: ~37.20%</li><li>1: ~62.80%</li></ul> |
 * Samples:
+  | sentence1                                                                                                                                                                        | sentence2                                                                                                                                                                      | label          |
+  |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------|
+  | <code>"Kahuku Ranch has world - class qualities - tremendous resources, tremendous beauty and tremendous value to global biodiversity."</code>                                   | <code>"TRENENDOUS BEAUTY AND TREMENDOUS VALUE TO GLOBAL BIODERVERSITY TRENENDOUS RESOURCES-CLASS QUALITIES KAHUKU RANCH HAS WORLD"</code>                                      | <code>1</code> |
+  | <code>In Damascus, Syrian Information Minister Ahmad al-Hassan called the charges "baseless and illogical".</code>                                                               | <code>The Syrian Information Minister Ahmad al-Hassan, in Damascus, termed the charges without base and with no logic behind</code>                                            | <code>1</code> |
+  | <code>We'd talk about the stars... ...and whether there might be somebody else like us out in space,... ...places we wanted to go and... it made our trials seem smaller.</code> | <code>We often would talk about the stars and if somebody else is similar to us out in the universe, places we wanted to visit and it made our problems seem minuscule.</code> | <code>1</code> |
 * Loss: [<code>AnglELoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#angleloss) with these parameters:
   ```json
   {
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 15.23 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 6 tokens</li><li>mean: 15.39 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 2.73</li><li>max: 5.0</li></ul> |
 * Samples:
+  | sentence1                                                                                         | sentence2                                                                                                  | label                           |
+  |:--------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------|:--------------------------------|
+  | <code>China's anger at N. Korea overcomes worry over US</code>                                    | <code>China's anger at North Korea overcomes worry over U.S. stealth flights</code>                        | <code>3.200000047683716</code>  |
+  | <code>Declining issues outnumbered advancers nearly 2 to 1 on the New York Stock Exchange.</code> | <code>Advancers outnumbered decliners by nearly 8 to 3 on the NYSE and more than 11 to 5 on Nasdaq.</code> | <code>1.7999999523162842</code> |
+  | <code>The computers were reportedly located in the U.S., Canada and South Korea.</code>           | <code>The PCs are scattered across the United States, Canada and South Korea.</code>                       | <code>4.75</code>               |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
   |         | sentence1                                                                         | sentence2                                                                         | label                                                          |
   |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:---------------------------------------------------------------|
   | type    | string                                                                            | string                                                                            | float                                                          |
+  | details | <ul><li>min: 6 tokens</li><li>mean: 12.39 tokens</li><li>max: 30 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 12.16 tokens</li><li>max: 38 tokens</li></ul> | <ul><li>min: 1.0</li><li>mean: 3.48</li><li>max: 5.0</li></ul> |
 * Samples:
+  | sentence1                                                          | sentence2                                                   | label                          |
+  |:-------------------------------------------------------------------|:------------------------------------------------------------|:-------------------------------|
+  | <code>Someone is cutting some paper with scissors</code>           | <code>The piece of paper is being cut</code>                | <code>4.5</code>               |
+  | <code>A man is hanging up the phone</code>                         | <code>A man is making a phone call</code>                   | <code>3.799999952316284</code> |
+  | <code>A person is pouring olive oil into a pot on the stove</code> | <code>A person is pouring oil for cooking into a pot</code> | <code>4.300000190734863</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 * Size: 14,280 training samples
 * Columns: <code>label</code>, <code>sentence1</code>, and <code>sentence2</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | label                                                          | sentence1                                                                         | sentence2                                                                         |
+  |:--------|:---------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
+  | type    | float                                                          | string                                                                            | string                                                                            |
+  | details | <ul><li>min: 0.0</li><li>mean: 3.16</li><li>max: 5.0</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 19.29 tokens</li><li>max: 80 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 17.45 tokens</li><li>max: 82 tokens</li></ul> |
 * Samples:
+  | label             | sentence1                                                                                                                     | sentence2                                                                |
+  |:------------------|:------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------|
+  | <code>1.0</code>  | <code>How do I wire a bathroom exhaust fan/light to two switches?</code>                                                      | <code>How do I wire a combo with two supplies?</code>                    |
+  | <code>4.2</code>  | <code>How an all-American hero fell to earth - . (Where have all the REAL heroes gone?)</code>                                | <code>How all-American hero fell to earth</code>                         |
+  | <code>3.75</code> | <code>Be larger in number, quantity, power,          status, or importance, without personally having sovereign power.</code> | <code>be larger in number, quantity, power, status or importance.</code> |
 * Loss: [<code>CoSENTLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosentloss) with these parameters:
   ```json
   {
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `per_device_train_batch_size`: 384
+- `learning_rate`: 1.0
+- `weight_decay`: 6e-05
 - `num_train_epochs`: 1
 - `fp16`: True
 #### All Hyperparameters
 <details><summary>Click to expand</summary>
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 384
 - `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `eval_accumulation_steps`: None
 - `torch_empty_cache_steps`: None
+- `learning_rate`: 1.0
+- `weight_decay`: 6e-05
 - `adam_beta1`: 0.9
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
+- `warmup_ratio`: 0.0
 - `warmup_steps`: 0
 - `log_level`: passive
 - `log_level_replica`: warning
 - `hub_private_repo`: None
 - `hub_always_push`: False
 - `hub_revision`: None
+- `gradient_checkpointing`: False
 - `gradient_checkpointing_kwargs`: None
 - `include_inputs_for_metrics`: False
 - `include_for_metrics`: []
 - `torchdynamo`: None
 - `ray_scope`: last
 - `ddp_timeout`: 1800
+- `torch_compile`: False
+- `torch_compile_backend`: None
 - `torch_compile_mode`: None
 - `include_tokens_per_second`: False
 - `include_num_input_tokens_seen`: no
 ### Training Logs
 | Epoch  | Step  | Training Loss |
 |:------:|:-----:|:-------------:|
+| 0.0267 | 500   | 4.3558        |
+| 0.0535 | 1000  | 3.0724        |
+| 0.0802 | 1500  | 2.979         |
+| 0.1070 | 2000  | 2.9205        |
+| 0.1337 | 2500  | 3.0679        |
+| 0.1604 | 3000  | 2.837         |
+| 0.1872 | 3500  | 3.2635        |
+| 0.2139 | 4000  | 2.7602        |
+| 0.2407 | 4500  | 2.6911        |
+| 0.2674 | 5000  | 2.6963        |
+| 0.2941 | 5500  | 2.8504        |
+| 0.3209 | 6000  | 2.7501        |
+| 0.3476 | 6500  | 2.6315        |
+| 0.3744 | 7000  | 2.5372        |
+| 0.4011 | 7500  | 2.8814        |
+| 0.4278 | 8000  | 2.2826        |
+| 0.4546 | 8500  | 2.764         |
+| 0.4813 | 9000  | 2.4418        |
+| 0.5080 | 9500  | 2.3762        |
+| 0.5348 | 10000 | 2.5542        |
+| 0.5615 | 10500 | 2.2653        |
+| 0.5883 | 11000 | 2.5098        |
+| 0.6150 | 11500 | 2.3009        |
+| 0.6417 | 12000 | 2.4029        |
+| 0.6685 | 12500 | 2.1538        |
+| 0.6952 | 13000 | 2.6398        |
+| 0.7220 | 13500 | 2.3101        |
+| 0.7487 | 14000 | 2.8489        |
+| 0.7754 | 14500 | 2.3822        |
+| 0.8022 | 15000 | 2.3035        |
+| 0.8289 | 15500 | 2.4212        |
+| 0.8557 | 16000 | 2.1447        |
+| 0.8824 | 16500 | 1.985         |
+| 0.9091 | 17000 | 2.1427        |
+| 0.9359 | 17500 | 2.3002        |
+| 0.9626 | 18000 | 2.2671        |
+| 0.9894 | 18500 | 2.3033        |
 ### Framework Versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3d9729ed5a375cb33fdfe9941bf4032235f8e37c6b27fa88b752ff736b85616b
 size 127538496

 version https://git-lfs.github.com/spec/v1
+oid sha256:e77447660a82d5a1c32834edf181ece19138af5fe0d1a489194f8674d0a79f19
 size 127538496