Upload folder using huggingface_hub
Browse files- README.md +313 -0
- config.json +35 -0
- model.safetensors +3 -0
- special_tokens_map.json +37 -0
- tokenizer.json +0 -0
- tokenizer_config.json +58 -0
- vocab.txt +0 -0
README.md
ADDED
|
@@ -0,0 +1,313 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- sentence-transformers
|
| 4 |
+
- cross-encoder
|
| 5 |
+
- reranker
|
| 6 |
+
- generated_from_trainer
|
| 7 |
+
- dataset_size:45
|
| 8 |
+
- loss:BinaryCrossEntropyLoss
|
| 9 |
+
base_model: cross-encoder/ms-marco-MiniLM-L6-v2
|
| 10 |
+
pipeline_tag: text-ranking
|
| 11 |
+
library_name: sentence-transformers
|
| 12 |
+
---
|
| 13 |
+
|
| 14 |
+
# CrossEncoder based on cross-encoder/ms-marco-MiniLM-L6-v2
|
| 15 |
+
|
| 16 |
+
This is a [Cross Encoder](https://www.sbert.net/docs/cross_encoder/usage/usage.html) model finetuned from [cross-encoder/ms-marco-MiniLM-L6-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L6-v2) using the [sentence-transformers](https://www.SBERT.net) library. It computes scores for pairs of texts, which can be used for text reranking and semantic search.
|
| 17 |
+
|
| 18 |
+
## Model Details
|
| 19 |
+
|
| 20 |
+
### Model Description
|
| 21 |
+
- **Model Type:** Cross Encoder
|
| 22 |
+
- **Base model:** [cross-encoder/ms-marco-MiniLM-L6-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L6-v2) <!-- at revision c5ee24cb16019beea0893ab7796b1df96625c6b8 -->
|
| 23 |
+
- **Maximum Sequence Length:** 512 tokens
|
| 24 |
+
- **Number of Output Labels:** 1 label
|
| 25 |
+
<!-- - **Training Dataset:** Unknown -->
|
| 26 |
+
<!-- - **Language:** Unknown -->
|
| 27 |
+
<!-- - **License:** Unknown -->
|
| 28 |
+
|
| 29 |
+
### Model Sources
|
| 30 |
+
|
| 31 |
+
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
|
| 32 |
+
- **Documentation:** [Cross Encoder Documentation](https://www.sbert.net/docs/cross_encoder/usage/usage.html)
|
| 33 |
+
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
|
| 34 |
+
- **Hugging Face:** [Cross Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=cross-encoder)
|
| 35 |
+
|
| 36 |
+
## Usage
|
| 37 |
+
|
| 38 |
+
### Direct Usage (Sentence Transformers)
|
| 39 |
+
|
| 40 |
+
First install the Sentence Transformers library:
|
| 41 |
+
|
| 42 |
+
```bash
|
| 43 |
+
pip install -U sentence-transformers
|
| 44 |
+
```
|
| 45 |
+
|
| 46 |
+
Then you can load this model and run inference.
|
| 47 |
+
```python
|
| 48 |
+
from sentence_transformers import CrossEncoder
|
| 49 |
+
|
| 50 |
+
# Download from the 🤗 Hub
|
| 51 |
+
model = CrossEncoder("cross_encoder_model_id")
|
| 52 |
+
# Get scores for pairs of texts
|
| 53 |
+
pairs = [
|
| 54 |
+
['Kalkulus I, Fisika Dasar I, Kimia Dasar I, Pemrograman, Praktikum Pemrograman, Pengantar Logika Matematika dan Himpunan, Statistika, Kalkulus II, Geometri Analitik, Aljabar Linear Elementer, Matematika Diskrit, Pengantar Struktur Aljabar I, Kalkulus Multivariabel I, Pengantar Struktur Aljabar II, Persamaan Diferensial Elementer, Pengantar Statistika Matematika I, Pengantar Analisis Numerik, Kalkulus Lanjut, Program Linear, Kalkulus Multivariabel II, Fungsi Variabel Kompleks I, Pengantar Analisis I, Aljabar Linear, Matematika Komputasi, Pengantar Persamaan Diferensial Parsial, Fungsi Variabel Kompleks II, Pengantar Analisis II, Pengantar Model Matematika, Pengantar Proses Stokastik, Pengantar Teori Bilangan, Geometri Transformasi, Geometri, Geometri di Ruang Euclide berdimensi-n, Kalkulus Variasi, Pengantar Topologi, Pengantar Teori Ukuran & Integral Lebesgue, Kalkulus Stokastik, Pengantar Teori Persamaan Diferensial, Pengantar Ruang Riesz, Pengantar Geometri Diferensial, Pengantar Analisis Fungsional, Kapita Selekta Analisis, Teori Himpunan, Aljabar Linear Terapan I, Pengantar Teori Graf, Pengantar Teori Partisi, Teori Grup Hingga, Pengantar Kombinatorik, Pengantar Teori Pengkodean, Pengantar Teori Semigrup, Aljabar Linear Terapan II, Pengantar Kriptografi, Pengantar Teori Modul, Kapita Selekta Aljabar, Aljabar Linear Numerik, Riset Operasi A, Pengantar Teori Permainan, Sistem Dinamik ♥, Riset Operasi B, Pengantar Teori Sistem ♥, Pengantar Persamaan Diferensial Stokastik, Pengantar Masalah Syarat Batas, Pengantar Teori Kendali, Matematika Biologi, Pengantar Matematika Machine Learning, Kapita Selekta Matematika Terapan A, Kapita Selekta Matematika Terapan B, Komputasi Masalah Invers, Pengantar Metode Elemen Hingga, Kapita Selekta Komputasi Matematika, Pengantar Metode Elemen Batas, Komputasi Machine Learning, Pengantar Geometri Fraktal', 'Metode Statistika, Praktikum Metode Statistika, Kalkulus 1, Fisika Dasar 1, Kimia Dasar 1, Pemrograman, Praktikum Pemrograman, Kalkulus II, Eksplorasi dan Visualisasi Data, Aljabar Matriks, Matematika Diskrit dan Kombinatorik, Bahasa Inggris, Basis Data, Analisis Regresi Terapan, Probabilitas dan Proses Stokastik, Bahasa Indonesia, Metode Survey Sampel, Kalkulus Multivariabel untuk Statistika, Persamaan Diferensial Elementer, Pengantar Rancangan Percobaan, Pengantar Statistika Matematik I, Komputasi Statistika I, Kalkulus Lanjut, Statistika Multivariat Terapan, Pengantar Statistika Matematik II, Statistika Ofisial, Pengantar Data Sains, Komputasi Statistika II, Pengantar Data Mining, Kerja Praktek, Pengantar Teori Ukuran dan Probabilitas, Kewarganegaraan, Pengantar Runtun Waktu, Tugas Akhir I, Tugas Akhir II, Pengantar Teori Antrian & Simulasi, Manajemen Risiko Kuantitatif, Biostatistika dan Epidemiologi, Pengantar Response Surface, Kapita Selekta Statistik A, Pengantar Teori Keputusan, Pengantar Manajemen Investasi, Model Linear Tergeneralisasi, Pengantar Matematika Finansial I, Pengantar Matematika Aktuaria I, Program Linear, Analisis Variansi Terapan, Demografi, Pengendalian Kualitas Statistik, Analisis Data Kategorik, Metode Peramalan, Metode Statistika Nonparametrik, Analisis Data Survival, Persamaan Model Struktural, Pengantar Ekonometri, Statistical Machine Learning, Kapita Selekta Statistik B, Pengantar Matematika Finansial II, Pengantar Matematika Aktuaria II, Riset Operasi A, Pengantar Analisis Data Panel'],
|
| 55 |
+
['Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancangan Teknik Elektro 2', 'Matematika Dasar, Aljabar Linier, Kalkulus 1, Pengenalan Teknologi Elektro dan Informatika Cerdas, Dasar Pemrograman, Matematika Diskrit, Sistem Operasi, Basisdata, Kalkulus 2, Probabilitas dan Statistik, Konsep Kecerdasan Artifisial, Struktur Data, Jaringan Komputer, Teori Graf, Keamanan Informasi, Perancangan dan Analisis Algoritma, Pemrograman Berorientasi Objek, Pembelajaran Mesin, Kecerdasan Komputasional, Penambangan Data, Pemrosesan Bahasa Natural, Dasar Pengembangan dan Perancangan Perangkat Lunak, Grafika Komputer, Interaksi Manusia dan Komputer, Pemrograman Web, Deep Learning, Pengolahan Citra Digital, Temu Kembali Informasi, Pemodelan 3D, Visi Komputer, Gim Cerdas, Realitas X, Visualisasi Informasi, Komputasi Biomedik, Kecerdasan Bisnis, Robotika, Visi Komputer 3D, Perancangan dan Pengembangan Gim, Pemrograman XR'],
|
| 56 |
+
['Pengawetan dan Aplikasi Produk Hayati (Natural Product Preservation and Application), Aplikasi Bioproses (Applied Bioprocess), Pemisahan dan Pemurnian Produk Hayati (Natural Product Separation and Purification), Proses Industri Pangan (Industrial Food Processing), Kapita Selekta di Teknik Produk Hayati dan Bioproses (Selected Topics in Natural Products Engineering and Bioprocess), Energi Tidak Terbarukan (Non-renewable Energy), Manajemen dan Konservasi Energi (Energy Management and Conservation), Energi Terbarukan (Renewable Energy), Utilisasi Energi (Energy Utilization), Kapita Selekta di Energi (Selected Topics in Energy), Produksi Bersih (Cleaner Production), Teknik Pengolahan Limbah (Waste Treatment Technology), Analisis Risiko Industri (Industrial Risk Analysis), Manajemen Keselamatan Proses dan Lingkungan (Process Safety and Environmental Management), Kapita Selekta di Keselamatan, Kesehatan dan Lingkungan (Selected Topics in Safety, Health and Environment), Pemrograman Komputer Lanjut (Advanced Computer Programming), Dinamika Fluida dalam Teknik Kimia (Fluid Dynamics in Chemical Engineering), Pemodelan Matematis Lanjut (Advanced Mathematical Modelling), Pemodelan Proses dan Sistem Dinamis (Process Modelling and System Dynamics), Perancangan Proses dengan Komputer (Computer-Aided Process Design), Kapita Selekta di Pemodelan Proses dan Komputasi (Selected Topics in Process Modelling and Computation), Teknologi Polimer (Polymer Technology), Teknologi Keramik (Ceramics Technology), Teknik Material Mutakhir (Advanced Engineering Material), Teknologi Komposit (Composite technology), Teknologi Elektrokimia (Electrochemical Technology), Pemisahan Setimbang Termodifikasi (Enhanced Equilibrium Separation), Kapita Selekta di Teknik Material dan Teknologi Mutakhir (Selected Topics in Material Engineering and Advanced Technology), Mineral Industri (Industrial Minerals), Karakterisasi Bahan Mineral (Mineral Materials Characterization), Teknologi Pengayaan Bijih Mineral (Ore Beneficiation Technology), Hidrometalurgi (Hydrometallurgy), Piro-metalurgi (Pyrometallurgy), Flotasi (Flotation), Kapita Selekta di Pemrosesan Mineral (Selected Topics in Mineral Processing), Analytical Chemistry and Instrumentation, General Chemistry, Organic Chemistry, Mathematics, Physics, Machine Element and Engineering Drawing, Physical Chemistry, Material Analysis Labwork, Materials of Construction for Chemical Engineering, Fundamental of Bioprocess, Electrical Power Engineering, Chemical Engineering Thermodynamics 1, Engineering Concept for Civilization, Prime Movers, Chemical Industrial Processes, Chemical Process Labwork, Chemical Engineering Principles, Applied Mathematics in Chemical Engineering, Material Transport and Storage, Chemical Engineering Thermodynamics 2, Engineering Economics, Transfer Processes, Particulate Processing, Numerical Methods, Computation Laboratory Work, Stage-wise Separation Processes, Mathematical modeling, Heat and Mass Transfer Operation, Heat Transfer, Chemical Reaction Engineering 1, Unit Operation Laboratory Work, Process Simulation and System Optimization, Scientific Method, Product Engineering, Water and wastewater treatment, Process Control, Research Project 1, Chemical Reaction Engineering 2, Utilization and Conservation of Natural Resources, Chemical Plant Design, Management, Entrepreneur ship, Research Project 2, Comprehensi ve Written Examination, Chemical Process Safety, Plant Design Project, Industrial Placement', 'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancangan Teknik Elektro 2'],
|
| 57 |
+
['Pengawetan dan Aplikasi Produk Hayati (Natural Product Preservation and Application), Aplikasi Bioproses (Applied Bioprocess), Pemisahan dan Pemurnian Produk Hayati (Natural Product Separation and Purification), Proses Industri Pangan (Industrial Food Processing), Kapita Selekta di Teknik Produk Hayati dan Bioproses (Selected Topics in Natural Products Engineering and Bioprocess), Energi Tidak Terbarukan (Non-renewable Energy), Manajemen dan Konservasi Energi (Energy Management and Conservation), Energi Terbarukan (Renewable Energy), Utilisasi Energi (Energy Utilization), Kapita Selekta di Energi (Selected Topics in Energy), Produksi Bersih (Cleaner Production), Teknik Pengolahan Limbah (Waste Treatment Technology), Analisis Risiko Industri (Industrial Risk Analysis), Manajemen Keselamatan Proses dan Lingkungan (Process Safety and Environmental Management), Kapita Selekta di Keselamatan, Kesehatan dan Lingkungan (Selected Topics in Safety, Health and Environment), Pemrograman Komputer Lanjut (Advanced Computer Programming), Dinamika Fluida dalam Teknik Kimia (Fluid Dynamics in Chemical Engineering), Pemodelan Matematis Lanjut (Advanced Mathematical Modelling), Pemodelan Proses dan Sistem Dinamis (Process Modelling and System Dynamics), Perancangan Proses dengan Komputer (Computer-Aided Process Design), Kapita Selekta di Pemodelan Proses dan Komputasi (Selected Topics in Process Modelling and Computation), Teknologi Polimer (Polymer Technology), Teknologi Keramik (Ceramics Technology), Teknik Material Mutakhir (Advanced Engineering Material), Teknologi Komposit (Composite technology), Teknologi Elektrokimia (Electrochemical Technology), Pemisahan Setimbang Termodifikasi (Enhanced Equilibrium Separation), Kapita Selekta di Teknik Material dan Teknologi Mutakhir (Selected Topics in Material Engineering and Advanced Technology), Mineral Industri (Industrial Minerals), Karakterisasi Bahan Mineral (Mineral Materials Characterization), Teknologi Pengayaan Bijih Mineral (Ore Beneficiation Technology), Hidrometalurgi (Hydrometallurgy), Piro-metalurgi (Pyrometallurgy), Flotasi (Flotation), Kapita Selekta di Pemrosesan Mineral (Selected Topics in Mineral Processing), Analytical Chemistry and Instrumentation, General Chemistry, Organic Chemistry, Mathematics, Physics, Machine Element and Engineering Drawing, Physical Chemistry, Material Analysis Labwork, Materials of Construction for Chemical Engineering, Fundamental of Bioprocess, Electrical Power Engineering, Chemical Engineering Thermodynamics 1, Engineering Concept for Civilization, Prime Movers, Chemical Industrial Processes, Chemical Process Labwork, Chemical Engineering Principles, Applied Mathematics in Chemical Engineering, Material Transport and Storage, Chemical Engineering Thermodynamics 2, Engineering Economics, Transfer Processes, Particulate Processing, Numerical Methods, Computation Laboratory Work, Stage-wise Separation Processes, Mathematical modeling, Heat and Mass Transfer Operation, Heat Transfer, Chemical Reaction Engineering 1, Unit Operation Laboratory Work, Process Simulation and System Optimization, Scientific Method, Product Engineering, Water and wastewater treatment, Process Control, Research Project 1, Chemical Reaction Engineering 2, Utilization and Conservation of Natural Resources, Chemical Plant Design, Management, Entrepreneur ship, Research Project 2, Comprehensi ve Written Examination, Chemical Process Safety, Plant Design Project, Industrial Placement', 'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekanika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prakt. Pemrograman Dasar, Analisis Variabel Kompleks, Fisika Listrik dan Magnet, Aljabar Linear, Persamaan DIfferensial, Probabilitas dan Variabel Acak, Algoritme dan Struktur Data, Statistika, Metode Numeris, Isyarat dan Sistem, Komunikasi Data dan Komputer, Pemrograman Berorientasi Obyek, Arsitektur Komputer, Prakt. Sains Dasar, Medan Elektromagnetik, Teknologi Basis Data, Sistem Berbasis Mikroprosesor, Kecerdasan Buatan, Teknik Visualisasi Grafis, Jaringan Komputer, Teknik Pemodelan dan Simulasi, Sistem Operasi, Proyek Junior Teknologi Informasi, Kerja Praktik, Rekayasa Perangkat Lunak, Rekayasa Data, Pengembangan Aplikasi Web, Komputasi Awan, Interaksi Manusia dan Komputer, Proyek Senior Teknologi Informasi, Proyek Perancangan Teknologi Informasi 1, Keamanan Komputer, Integrasi Aplikasi dan Informasi, Proyek Perancangan Teknologi Informasi 2'],
|
| 58 |
+
['Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekanika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prakt. Pemrograman Dasar, Analisis Variabel Kompleks, Fisika Listrik dan Magnet, Aljabar Linear, Persamaan DIfferensial, Probabilitas dan Variabel Acak, Algoritme dan Struktur Data, Statistika, Metode Numeris, Isyarat dan Sistem, Komunikasi Data dan Komputer, Pemrograman Berorientasi Obyek, Arsitektur Komputer, Prakt. Sains Dasar, Medan Elektromagnetik, Teknologi Basis Data, Sistem Berbasis Mikroprosesor, Kecerdasan Buatan, Teknik Visualisasi Grafis, Jaringan Komputer, Teknik Pemodelan dan Simulasi, Sistem Operasi, Proyek Junior Teknologi Informasi, Kerja Praktik, Rekayasa Perangkat Lunak, Rekayasa Data, Pengembangan Aplikasi Web, Komputasi Awan, Interaksi Manusia dan Komputer, Proyek Senior Teknologi Informasi, Proyek Perancangan Teknologi Informasi 1, Keamanan Komputer, Integrasi Aplikasi dan Informasi, Proyek Perancangan Teknologi Informasi 2', 'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancangan Teknik Elektro 2'],
|
| 59 |
+
]
|
| 60 |
+
scores = model.predict(pairs)
|
| 61 |
+
print(scores.shape)
|
| 62 |
+
# (5,)
|
| 63 |
+
|
| 64 |
+
# Or rank different texts based on similarity to a single text
|
| 65 |
+
ranks = model.rank(
|
| 66 |
+
'Kalkulus I, Fisika Dasar I, Kimia Dasar I, Pemrograman, Praktikum Pemrograman, Pengantar Logika Matematika dan Himpunan, Statistika, Kalkulus II, Geometri Analitik, Aljabar Linear Elementer, Matematika Diskrit, Pengantar Struktur Aljabar I, Kalkulus Multivariabel I, Pengantar Struktur Aljabar II, Persamaan Diferensial Elementer, Pengantar Statistika Matematika I, Pengantar Analisis Numerik, Kalkulus Lanjut, Program Linear, Kalkulus Multivariabel II, Fungsi Variabel Kompleks I, Pengantar Analisis I, Aljabar Linear, Matematika Komputasi, Pengantar Persamaan Diferensial Parsial, Fungsi Variabel Kompleks II, Pengantar Analisis II, Pengantar Model Matematika, Pengantar Proses Stokastik, Pengantar Teori Bilangan, Geometri Transformasi, Geometri, Geometri di Ruang Euclide berdimensi-n, Kalkulus Variasi, Pengantar Topologi, Pengantar Teori Ukuran & Integral Lebesgue, Kalkulus Stokastik, Pengantar Teori Persamaan Diferensial, Pengantar Ruang Riesz, Pengantar Geometri Diferensial, Pengantar Analisis Fungsional, Kapita Selekta Analisis, Teori Himpunan, Aljabar Linear Terapan I, Pengantar Teori Graf, Pengantar Teori Partisi, Teori Grup Hingga, Pengantar Kombinatorik, Pengantar Teori Pengkodean, Pengantar Teori Semigrup, Aljabar Linear Terapan II, Pengantar Kriptografi, Pengantar Teori Modul, Kapita Selekta Aljabar, Aljabar Linear Numerik, Riset Operasi A, Pengantar Teori Permainan, Sistem Dinamik ♥, Riset Operasi B, Pengantar Teori Sistem ♥, Pengantar Persamaan Diferensial Stokastik, Pengantar Masalah Syarat Batas, Pengantar Teori Kendali, Matematika Biologi, Pengantar Matematika Machine Learning, Kapita Selekta Matematika Terapan A, Kapita Selekta Matematika Terapan B, Komputasi Masalah Invers, Pengantar Metode Elemen Hingga, Kapita Selekta Komputasi Matematika, Pengantar Metode Elemen Batas, Komputasi Machine Learning, Pengantar Geometri Fraktal',
|
| 67 |
+
[
|
| 68 |
+
'Metode Statistika, Praktikum Metode Statistika, Kalkulus 1, Fisika Dasar 1, Kimia Dasar 1, Pemrograman, Praktikum Pemrograman, Kalkulus II, Eksplorasi dan Visualisasi Data, Aljabar Matriks, Matematika Diskrit dan Kombinatorik, Bahasa Inggris, Basis Data, Analisis Regresi Terapan, Probabilitas dan Proses Stokastik, Bahasa Indonesia, Metode Survey Sampel, Kalkulus Multivariabel untuk Statistika, Persamaan Diferensial Elementer, Pengantar Rancangan Percobaan, Pengantar Statistika Matematik I, Komputasi Statistika I, Kalkulus Lanjut, Statistika Multivariat Terapan, Pengantar Statistika Matematik II, Statistika Ofisial, Pengantar Data Sains, Komputasi Statistika II, Pengantar Data Mining, Kerja Praktek, Pengantar Teori Ukuran dan Probabilitas, Kewarganegaraan, Pengantar Runtun Waktu, Tugas Akhir I, Tugas Akhir II, Pengantar Teori Antrian & Simulasi, Manajemen Risiko Kuantitatif, Biostatistika dan Epidemiologi, Pengantar Response Surface, Kapita Selekta Statistik A, Pengantar Teori Keputusan, Pengantar Manajemen Investasi, Model Linear Tergeneralisasi, Pengantar Matematika Finansial I, Pengantar Matematika Aktuaria I, Program Linear, Analisis Variansi Terapan, Demografi, Pengendalian Kualitas Statistik, Analisis Data Kategorik, Metode Peramalan, Metode Statistika Nonparametrik, Analisis Data Survival, Persamaan Model Struktural, Pengantar Ekonometri, Statistical Machine Learning, Kapita Selekta Statistik B, Pengantar Matematika Finansial II, Pengantar Matematika Aktuaria II, Riset Operasi A, Pengantar Analisis Data Panel',
|
| 69 |
+
'Matematika Dasar, Aljabar Linier, Kalkulus 1, Pengenalan Teknologi Elektro dan Informatika Cerdas, Dasar Pemrograman, Matematika Diskrit, Sistem Operasi, Basisdata, Kalkulus 2, Probabilitas dan Statistik, Konsep Kecerdasan Artifisial, Struktur Data, Jaringan Komputer, Teori Graf, Keamanan Informasi, Perancangan dan Analisis Algoritma, Pemrograman Berorientasi Objek, Pembelajaran Mesin, Kecerdasan Komputasional, Penambangan Data, Pemrosesan Bahasa Natural, Dasar Pengembangan dan Perancangan Perangkat Lunak, Grafika Komputer, Interaksi Manusia dan Komputer, Pemrograman Web, Deep Learning, Pengolahan Citra Digital, Temu Kembali Informasi, Pemodelan 3D, Visi Komputer, Gim Cerdas, Realitas X, Visualisasi Informasi, Komputasi Biomedik, Kecerdasan Bisnis, Robotika, Visi Komputer 3D, Perancangan dan Pengembangan Gim, Pemrograman XR',
|
| 70 |
+
'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancangan Teknik Elektro 2',
|
| 71 |
+
'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekanika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prakt. Pemrograman Dasar, Analisis Variabel Kompleks, Fisika Listrik dan Magnet, Aljabar Linear, Persamaan DIfferensial, Probabilitas dan Variabel Acak, Algoritme dan Struktur Data, Statistika, Metode Numeris, Isyarat dan Sistem, Komunikasi Data dan Komputer, Pemrograman Berorientasi Obyek, Arsitektur Komputer, Prakt. Sains Dasar, Medan Elektromagnetik, Teknologi Basis Data, Sistem Berbasis Mikroprosesor, Kecerdasan Buatan, Teknik Visualisasi Grafis, Jaringan Komputer, Teknik Pemodelan dan Simulasi, Sistem Operasi, Proyek Junior Teknologi Informasi, Kerja Praktik, Rekayasa Perangkat Lunak, Rekayasa Data, Pengembangan Aplikasi Web, Komputasi Awan, Interaksi Manusia dan Komputer, Proyek Senior Teknologi Informasi, Proyek Perancangan Teknologi Informasi 1, Keamanan Komputer, Integrasi Aplikasi dan Informasi, Proyek Perancangan Teknologi Informasi 2',
|
| 72 |
+
'Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancangan Teknik Elektro 2',
|
| 73 |
+
]
|
| 74 |
+
)
|
| 75 |
+
# [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]
|
| 76 |
+
```
|
| 77 |
+
|
| 78 |
+
<!--
|
| 79 |
+
### Direct Usage (Transformers)
|
| 80 |
+
|
| 81 |
+
<details><summary>Click to see the direct usage in Transformers</summary>
|
| 82 |
+
|
| 83 |
+
</details>
|
| 84 |
+
-->
|
| 85 |
+
|
| 86 |
+
<!--
|
| 87 |
+
### Downstream Usage (Sentence Transformers)
|
| 88 |
+
|
| 89 |
+
You can finetune this model on your own dataset.
|
| 90 |
+
|
| 91 |
+
<details><summary>Click to expand</summary>
|
| 92 |
+
|
| 93 |
+
</details>
|
| 94 |
+
-->
|
| 95 |
+
|
| 96 |
+
<!--
|
| 97 |
+
### Out-of-Scope Use
|
| 98 |
+
|
| 99 |
+
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
| 100 |
+
-->
|
| 101 |
+
|
| 102 |
+
<!--
|
| 103 |
+
## Bias, Risks and Limitations
|
| 104 |
+
|
| 105 |
+
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
|
| 106 |
+
-->
|
| 107 |
+
|
| 108 |
+
<!--
|
| 109 |
+
### Recommendations
|
| 110 |
+
|
| 111 |
+
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 112 |
+
-->
|
| 113 |
+
|
| 114 |
+
## Training Details
|
| 115 |
+
|
| 116 |
+
### Training Dataset
|
| 117 |
+
|
| 118 |
+
#### Unnamed Dataset
|
| 119 |
+
|
| 120 |
+
* Size: 45 training samples
|
| 121 |
+
* Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>label</code>
|
| 122 |
+
* Approximate statistics based on the first 45 samples:
|
| 123 |
+
| | sentence_0 | sentence_1 | label |
|
| 124 |
+
|:--------|:----------------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------|:---------------------------------------------------------------|
|
| 125 |
+
| type | string | string | float |
|
| 126 |
+
| details | <ul><li>min: 818 characters</li><li>mean: 1846.82 characters</li><li>max: 3490 characters</li></ul> | <ul><li>min: 818 characters</li><li>mean: 1345.38 characters</li><li>max: 3490 characters</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.49</li><li>max: 1.0</li></ul> |
|
| 127 |
+
* Samples:
|
| 128 |
+
| sentence_0 | sentence_1 | label |
|
| 129 |
+
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
|
| 130 |
+
| <code>Kalkulus I, Fisika Dasar I, Kimia Dasar I, Pemrograman, Praktikum Pemrograman, Pengantar Logika Matematika dan Himpunan, Statistika, Kalkulus II, Geometri Analitik, Aljabar Linear Elementer, Matematika Diskrit, Pengantar Struktur Aljabar I, Kalkulus Multivariabel I, Pengantar Struktur Aljabar II, Persamaan Diferensial Elementer, Pengantar Statistika Matematika I, Pengantar Analisis Numerik, Kalkulus Lanjut, Program Linear, Kalkulus Multivariabel II, Fungsi Variabel Kompleks I, Pengantar Analisis I, Aljabar Linear, Matematika Komputasi, Pengantar Persamaan Diferensial Parsial, Fungsi Variabel Kompleks II, Pengantar Analisis II, Pengantar Model Matematika, Pengantar Proses Stokastik, Pengantar Teori Bilangan, Geometri Transformasi, Geometri, Geometri di Ruang Euclide berdimensi-n, Kalkulus Variasi, Pengantar Topologi, Pengantar Teori Ukuran & Integral Lebesgue, Kalkulus Stokastik, Pengantar Teori Persamaan Diferensial, Pengantar Ruang Riesz, Pengantar Geometri Diferensial, Pengantar Anal...</code> | <code>Metode Statistika, Praktikum Metode Statistika, Kalkulus 1, Fisika Dasar 1, Kimia Dasar 1, Pemrograman, Praktikum Pemrograman, Kalkulus II, Eksplorasi dan Visualisasi Data, Aljabar Matriks, Matematika Diskrit dan Kombinatorik, Bahasa Inggris, Basis Data, Analisis Regresi Terapan, Probabilitas dan Proses Stokastik, Bahasa Indonesia, Metode Survey Sampel, Kalkulus Multivariabel untuk Statistika, Persamaan Diferensial Elementer, Pengantar Rancangan Percobaan, Pengantar Statistika Matematik I, Komputasi Statistika I, Kalkulus Lanjut, Statistika Multivariat Terapan, Pengantar Statistika Matematik II, Statistika Ofisial, Pengantar Data Sains, Komputasi Statistika II, Pengantar Data Mining, Kerja Praktek, Pengantar Teori Ukuran dan Probabilitas, Kewarganegaraan, Pengantar Runtun Waktu, Tugas Akhir I, Tugas Akhir II, Pengantar Teori Antrian & Simulasi, Manajemen Risiko Kuantitatif, Biostatistika dan Epidemiologi, Pengantar Response Surface, Kapita Selekta Statistik A, Pengantar Teori Keputusan...</code> | <code>1.0</code> |
|
| 131 |
+
| <code>Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancanga...</code> | <code>Matematika Dasar, Aljabar Linier, Kalkulus 1, Pengenalan Teknologi Elektro dan Informatika Cerdas, Dasar Pemrograman, Matematika Diskrit, Sistem Operasi, Basisdata, Kalkulus 2, Probabilitas dan Statistik, Konsep Kecerdasan Artifisial, Struktur Data, Jaringan Komputer, Teori Graf, Keamanan Informasi, Perancangan dan Analisis Algoritma, Pemrograman Berorientasi Objek, Pembelajaran Mesin, Kecerdasan Komputasional, Penambangan Data, Pemrosesan Bahasa Natural, Dasar Pengembangan dan Perancangan Perangkat Lunak, Grafika Komputer, Interaksi Manusia dan Komputer, Pemrograman Web, Deep Learning, Pengolahan Citra Digital, Temu Kembali Informasi, Pemodelan 3D, Visi Komputer, Gim Cerdas, Realitas X, Visualisasi Informasi, Komputasi Biomedik, Kecerdasan Bisnis, Robotika, Visi Komputer 3D, Perancangan dan Pengembangan Gim, Pemrograman XR</code> | <code>0.0</code> |
|
| 132 |
+
| <code>Pengawetan dan Aplikasi Produk Hayati (Natural Product Preservation and Application), Aplikasi Bioproses (Applied Bioprocess), Pemisahan dan Pemurnian Produk Hayati (Natural Product Separation and Purification), Proses Industri Pangan (Industrial Food Processing), Kapita Selekta di Teknik Produk Hayati dan Bioproses (Selected Topics in Natural Products Engineering and Bioprocess), Energi Tidak Terbarukan (Non-renewable Energy), Manajemen dan Konservasi Energi (Energy Management and Conservation), Energi Terbarukan (Renewable Energy), Utilisasi Energi (Energy Utilization), Kapita Selekta di Energi (Selected Topics in Energy), Produksi Bersih (Cleaner Production), Teknik Pengolahan Limbah (Waste Treatment Technology), Analisis Risiko Industri (Industrial Risk Analysis), Manajemen Keselamatan Proses dan Lingkungan (Process Safety and Environmental Management), Kapita Selekta di Keselamatan, Kesehatan dan Lingkungan (Selected Topics in Safety, Health and Environment), Pemrograman Komputer ...</code> | <code>Pemrograman Dasar, Matematika Diskrit, Teori Vektor dan Matriks, Kalkulus Variabel Tunggal, Fisika Mekainika Klasik, Kalkulus Variabel Jamak, Fisika Fluida, Kalor, dan Gelombang, Prak. Pemrograman Dasar, Algoritma dan Struktur Data, Analisis Variabel Kompleks, Aljabar Linear, Fisika Listrik dan Magnet, Persamaan Differensial, Probabilitas dan Variabel Acak, Metode Numeris, Statistika, Isyarat dan Sistem, Teknik Digital, Analisis Untai Elektrik DC, Telekomunikasi Dasar, Prakt. Sains Dasar, Medan Elektromagnetik, Elektronika Dasar, Teknik Kendali, Sistem Mikroprosesor, Teknik Pengolahan Isyarat Digital, Analisis Untai Elektrik AC, Pengukuran dan Instrumentasi, Teknik Optimisasi, Jaringan dan Komunikasi Data, Kerja Praktik, Elektronika Analog, Perancangan Sistem Kendali Modern, Mesin Listrik 1, Teknik Tenaga Listrik, Proyek Junior Teknik Elektro, Sistem Komunikasi, Mesin Listrik 2, Analisis Sistem Tenaga, Proyek Perancangan Teknik Elektro 1, Proyek Senior Teknik Elektro, Proyek Perancanga...</code> | <code>0.0</code> |
|
| 133 |
+
* Loss: [<code>BinaryCrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#binarycrossentropyloss) with these parameters:
|
| 134 |
+
```json
|
| 135 |
+
{
|
| 136 |
+
"activation_fn": "torch.nn.modules.linear.Identity",
|
| 137 |
+
"pos_weight": null
|
| 138 |
+
}
|
| 139 |
+
```
|
| 140 |
+
|
| 141 |
+
### Training Hyperparameters
|
| 142 |
+
#### Non-Default Hyperparameters
|
| 143 |
+
|
| 144 |
+
- `num_train_epochs`: 10
|
| 145 |
+
|
| 146 |
+
#### All Hyperparameters
|
| 147 |
+
<details><summary>Click to expand</summary>
|
| 148 |
+
|
| 149 |
+
- `overwrite_output_dir`: False
|
| 150 |
+
- `do_predict`: False
|
| 151 |
+
- `eval_strategy`: no
|
| 152 |
+
- `prediction_loss_only`: True
|
| 153 |
+
- `per_device_train_batch_size`: 8
|
| 154 |
+
- `per_device_eval_batch_size`: 8
|
| 155 |
+
- `per_gpu_train_batch_size`: None
|
| 156 |
+
- `per_gpu_eval_batch_size`: None
|
| 157 |
+
- `gradient_accumulation_steps`: 1
|
| 158 |
+
- `eval_accumulation_steps`: None
|
| 159 |
+
- `torch_empty_cache_steps`: None
|
| 160 |
+
- `learning_rate`: 5e-05
|
| 161 |
+
- `weight_decay`: 0.0
|
| 162 |
+
- `adam_beta1`: 0.9
|
| 163 |
+
- `adam_beta2`: 0.999
|
| 164 |
+
- `adam_epsilon`: 1e-08
|
| 165 |
+
- `max_grad_norm`: 1
|
| 166 |
+
- `num_train_epochs`: 10
|
| 167 |
+
- `max_steps`: -1
|
| 168 |
+
- `lr_scheduler_type`: linear
|
| 169 |
+
- `lr_scheduler_kwargs`: {}
|
| 170 |
+
- `warmup_ratio`: 0.0
|
| 171 |
+
- `warmup_steps`: 0
|
| 172 |
+
- `log_level`: passive
|
| 173 |
+
- `log_level_replica`: warning
|
| 174 |
+
- `log_on_each_node`: True
|
| 175 |
+
- `logging_nan_inf_filter`: True
|
| 176 |
+
- `save_safetensors`: True
|
| 177 |
+
- `save_on_each_node`: False
|
| 178 |
+
- `save_only_model`: False
|
| 179 |
+
- `restore_callback_states_from_checkpoint`: False
|
| 180 |
+
- `no_cuda`: False
|
| 181 |
+
- `use_cpu`: False
|
| 182 |
+
- `use_mps_device`: False
|
| 183 |
+
- `seed`: 42
|
| 184 |
+
- `data_seed`: None
|
| 185 |
+
- `jit_mode_eval`: False
|
| 186 |
+
- `bf16`: False
|
| 187 |
+
- `fp16`: False
|
| 188 |
+
- `fp16_opt_level`: O1
|
| 189 |
+
- `half_precision_backend`: auto
|
| 190 |
+
- `bf16_full_eval`: False
|
| 191 |
+
- `fp16_full_eval`: False
|
| 192 |
+
- `tf32`: None
|
| 193 |
+
- `local_rank`: 0
|
| 194 |
+
- `ddp_backend`: None
|
| 195 |
+
- `tpu_num_cores`: None
|
| 196 |
+
- `tpu_metrics_debug`: False
|
| 197 |
+
- `debug`: []
|
| 198 |
+
- `dataloader_drop_last`: False
|
| 199 |
+
- `dataloader_num_workers`: 0
|
| 200 |
+
- `dataloader_prefetch_factor`: None
|
| 201 |
+
- `past_index`: -1
|
| 202 |
+
- `disable_tqdm`: False
|
| 203 |
+
- `remove_unused_columns`: True
|
| 204 |
+
- `label_names`: None
|
| 205 |
+
- `load_best_model_at_end`: False
|
| 206 |
+
- `ignore_data_skip`: False
|
| 207 |
+
- `fsdp`: []
|
| 208 |
+
- `fsdp_min_num_params`: 0
|
| 209 |
+
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
|
| 210 |
+
- `fsdp_transformer_layer_cls_to_wrap`: None
|
| 211 |
+
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
|
| 212 |
+
- `parallelism_config`: None
|
| 213 |
+
- `deepspeed`: None
|
| 214 |
+
- `label_smoothing_factor`: 0.0
|
| 215 |
+
- `optim`: adamw_torch_fused
|
| 216 |
+
- `optim_args`: None
|
| 217 |
+
- `adafactor`: False
|
| 218 |
+
- `group_by_length`: False
|
| 219 |
+
- `length_column_name`: length
|
| 220 |
+
- `project`: huggingface
|
| 221 |
+
- `trackio_space_id`: trackio
|
| 222 |
+
- `ddp_find_unused_parameters`: None
|
| 223 |
+
- `ddp_bucket_cap_mb`: None
|
| 224 |
+
- `ddp_broadcast_buffers`: False
|
| 225 |
+
- `dataloader_pin_memory`: True
|
| 226 |
+
- `dataloader_persistent_workers`: False
|
| 227 |
+
- `skip_memory_metrics`: True
|
| 228 |
+
- `use_legacy_prediction_loop`: False
|
| 229 |
+
- `push_to_hub`: False
|
| 230 |
+
- `resume_from_checkpoint`: None
|
| 231 |
+
- `hub_model_id`: None
|
| 232 |
+
- `hub_strategy`: every_save
|
| 233 |
+
- `hub_private_repo`: None
|
| 234 |
+
- `hub_always_push`: False
|
| 235 |
+
- `hub_revision`: None
|
| 236 |
+
- `gradient_checkpointing`: False
|
| 237 |
+
- `gradient_checkpointing_kwargs`: None
|
| 238 |
+
- `include_inputs_for_metrics`: False
|
| 239 |
+
- `include_for_metrics`: []
|
| 240 |
+
- `eval_do_concat_batches`: True
|
| 241 |
+
- `fp16_backend`: auto
|
| 242 |
+
- `push_to_hub_model_id`: None
|
| 243 |
+
- `push_to_hub_organization`: None
|
| 244 |
+
- `mp_parameters`:
|
| 245 |
+
- `auto_find_batch_size`: False
|
| 246 |
+
- `full_determinism`: False
|
| 247 |
+
- `torchdynamo`: None
|
| 248 |
+
- `ray_scope`: last
|
| 249 |
+
- `ddp_timeout`: 1800
|
| 250 |
+
- `torch_compile`: False
|
| 251 |
+
- `torch_compile_backend`: None
|
| 252 |
+
- `torch_compile_mode`: None
|
| 253 |
+
- `include_tokens_per_second`: False
|
| 254 |
+
- `include_num_input_tokens_seen`: no
|
| 255 |
+
- `neftune_noise_alpha`: None
|
| 256 |
+
- `optim_target_modules`: None
|
| 257 |
+
- `batch_eval_metrics`: False
|
| 258 |
+
- `eval_on_start`: False
|
| 259 |
+
- `use_liger_kernel`: False
|
| 260 |
+
- `liger_kernel_config`: None
|
| 261 |
+
- `eval_use_gather_object`: False
|
| 262 |
+
- `average_tokens_across_devices`: True
|
| 263 |
+
- `prompts`: None
|
| 264 |
+
- `batch_sampler`: batch_sampler
|
| 265 |
+
- `multi_dataset_batch_sampler`: proportional
|
| 266 |
+
- `router_mapping`: {}
|
| 267 |
+
- `learning_rate_mapping`: {}
|
| 268 |
+
|
| 269 |
+
</details>
|
| 270 |
+
|
| 271 |
+
### Framework Versions
|
| 272 |
+
- Python: 3.12.12
|
| 273 |
+
- Sentence Transformers: 5.1.2
|
| 274 |
+
- Transformers: 4.57.1
|
| 275 |
+
- PyTorch: 2.8.0+cu126
|
| 276 |
+
- Accelerate: 1.11.0
|
| 277 |
+
- Datasets: 4.0.0
|
| 278 |
+
- Tokenizers: 0.22.1
|
| 279 |
+
|
| 280 |
+
## Citation
|
| 281 |
+
|
| 282 |
+
### BibTeX
|
| 283 |
+
|
| 284 |
+
#### Sentence Transformers
|
| 285 |
+
```bibtex
|
| 286 |
+
@inproceedings{reimers-2019-sentence-bert,
|
| 287 |
+
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
| 288 |
+
author = "Reimers, Nils and Gurevych, Iryna",
|
| 289 |
+
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
| 290 |
+
month = "11",
|
| 291 |
+
year = "2019",
|
| 292 |
+
publisher = "Association for Computational Linguistics",
|
| 293 |
+
url = "https://arxiv.org/abs/1908.10084",
|
| 294 |
+
}
|
| 295 |
+
```
|
| 296 |
+
|
| 297 |
+
<!--
|
| 298 |
+
## Glossary
|
| 299 |
+
|
| 300 |
+
*Clearly define terms in order to be accessible across audiences.*
|
| 301 |
+
-->
|
| 302 |
+
|
| 303 |
+
<!--
|
| 304 |
+
## Model Card Authors
|
| 305 |
+
|
| 306 |
+
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
|
| 307 |
+
-->
|
| 308 |
+
|
| 309 |
+
<!--
|
| 310 |
+
## Model Card Contact
|
| 311 |
+
|
| 312 |
+
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
| 313 |
+
-->
|
config.json
ADDED
|
@@ -0,0 +1,35 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"BertForSequenceClassification"
|
| 4 |
+
],
|
| 5 |
+
"attention_probs_dropout_prob": 0.1,
|
| 6 |
+
"classifier_dropout": null,
|
| 7 |
+
"dtype": "float32",
|
| 8 |
+
"gradient_checkpointing": false,
|
| 9 |
+
"hidden_act": "gelu",
|
| 10 |
+
"hidden_dropout_prob": 0.1,
|
| 11 |
+
"hidden_size": 384,
|
| 12 |
+
"id2label": {
|
| 13 |
+
"0": "LABEL_0"
|
| 14 |
+
},
|
| 15 |
+
"initializer_range": 0.02,
|
| 16 |
+
"intermediate_size": 1536,
|
| 17 |
+
"label2id": {
|
| 18 |
+
"LABEL_0": 0
|
| 19 |
+
},
|
| 20 |
+
"layer_norm_eps": 1e-12,
|
| 21 |
+
"max_position_embeddings": 512,
|
| 22 |
+
"model_type": "bert",
|
| 23 |
+
"num_attention_heads": 12,
|
| 24 |
+
"num_hidden_layers": 6,
|
| 25 |
+
"pad_token_id": 0,
|
| 26 |
+
"position_embedding_type": "absolute",
|
| 27 |
+
"sentence_transformers": {
|
| 28 |
+
"activation_fn": "torch.nn.modules.linear.Identity",
|
| 29 |
+
"version": "5.1.2"
|
| 30 |
+
},
|
| 31 |
+
"transformers_version": "4.57.1",
|
| 32 |
+
"type_vocab_size": 2,
|
| 33 |
+
"use_cache": true,
|
| 34 |
+
"vocab_size": 30522
|
| 35 |
+
}
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:6383b5f432cac09f0e41b4e8e3d53f249a3d047f460eb1d16151edf464e45009
|
| 3 |
+
size 90866412
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,37 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"cls_token": {
|
| 3 |
+
"content": "[CLS]",
|
| 4 |
+
"lstrip": false,
|
| 5 |
+
"normalized": false,
|
| 6 |
+
"rstrip": false,
|
| 7 |
+
"single_word": false
|
| 8 |
+
},
|
| 9 |
+
"mask_token": {
|
| 10 |
+
"content": "[MASK]",
|
| 11 |
+
"lstrip": false,
|
| 12 |
+
"normalized": false,
|
| 13 |
+
"rstrip": false,
|
| 14 |
+
"single_word": false
|
| 15 |
+
},
|
| 16 |
+
"pad_token": {
|
| 17 |
+
"content": "[PAD]",
|
| 18 |
+
"lstrip": false,
|
| 19 |
+
"normalized": false,
|
| 20 |
+
"rstrip": false,
|
| 21 |
+
"single_word": false
|
| 22 |
+
},
|
| 23 |
+
"sep_token": {
|
| 24 |
+
"content": "[SEP]",
|
| 25 |
+
"lstrip": false,
|
| 26 |
+
"normalized": false,
|
| 27 |
+
"rstrip": false,
|
| 28 |
+
"single_word": false
|
| 29 |
+
},
|
| 30 |
+
"unk_token": {
|
| 31 |
+
"content": "[UNK]",
|
| 32 |
+
"lstrip": false,
|
| 33 |
+
"normalized": false,
|
| 34 |
+
"rstrip": false,
|
| 35 |
+
"single_word": false
|
| 36 |
+
}
|
| 37 |
+
}
|
tokenizer.json
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,58 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"added_tokens_decoder": {
|
| 3 |
+
"0": {
|
| 4 |
+
"content": "[PAD]",
|
| 5 |
+
"lstrip": false,
|
| 6 |
+
"normalized": false,
|
| 7 |
+
"rstrip": false,
|
| 8 |
+
"single_word": false,
|
| 9 |
+
"special": true
|
| 10 |
+
},
|
| 11 |
+
"100": {
|
| 12 |
+
"content": "[UNK]",
|
| 13 |
+
"lstrip": false,
|
| 14 |
+
"normalized": false,
|
| 15 |
+
"rstrip": false,
|
| 16 |
+
"single_word": false,
|
| 17 |
+
"special": true
|
| 18 |
+
},
|
| 19 |
+
"101": {
|
| 20 |
+
"content": "[CLS]",
|
| 21 |
+
"lstrip": false,
|
| 22 |
+
"normalized": false,
|
| 23 |
+
"rstrip": false,
|
| 24 |
+
"single_word": false,
|
| 25 |
+
"special": true
|
| 26 |
+
},
|
| 27 |
+
"102": {
|
| 28 |
+
"content": "[SEP]",
|
| 29 |
+
"lstrip": false,
|
| 30 |
+
"normalized": false,
|
| 31 |
+
"rstrip": false,
|
| 32 |
+
"single_word": false,
|
| 33 |
+
"special": true
|
| 34 |
+
},
|
| 35 |
+
"103": {
|
| 36 |
+
"content": "[MASK]",
|
| 37 |
+
"lstrip": false,
|
| 38 |
+
"normalized": false,
|
| 39 |
+
"rstrip": false,
|
| 40 |
+
"single_word": false,
|
| 41 |
+
"special": true
|
| 42 |
+
}
|
| 43 |
+
},
|
| 44 |
+
"clean_up_tokenization_spaces": true,
|
| 45 |
+
"cls_token": "[CLS]",
|
| 46 |
+
"do_basic_tokenize": true,
|
| 47 |
+
"do_lower_case": true,
|
| 48 |
+
"extra_special_tokens": {},
|
| 49 |
+
"mask_token": "[MASK]",
|
| 50 |
+
"model_max_length": 512,
|
| 51 |
+
"never_split": null,
|
| 52 |
+
"pad_token": "[PAD]",
|
| 53 |
+
"sep_token": "[SEP]",
|
| 54 |
+
"strip_accents": null,
|
| 55 |
+
"tokenize_chinese_chars": true,
|
| 56 |
+
"tokenizer_class": "BertTokenizer",
|
| 57 |
+
"unk_token": "[UNK]"
|
| 58 |
+
}
|
vocab.txt
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|