---
base_model: BAAI/bge-small-en-v1.5
datasets: []
language: []
library_name: sentence-transformers
metrics:
- cosine_accuracy@1
- cosine_accuracy@5
- cosine_accuracy@10
- cosine_precision@1
- cosine_precision@5
- cosine_precision@10
- cosine_recall@1
- cosine_recall@5
- cosine_recall@10
- cosine_ndcg@5
- cosine_ndcg@10
- cosine_ndcg@100
- cosine_mrr@5
- cosine_mrr@10
- cosine_mrr@100
- cosine_map@100
- dot_accuracy@1
- dot_accuracy@5
- dot_accuracy@10
- dot_precision@1
- dot_precision@5
- dot_precision@10
- dot_recall@1
- dot_recall@5
- dot_recall@10
- dot_ndcg@5
- dot_ndcg@10
- dot_ndcg@100
- dot_mrr@5
- dot_mrr@10
- dot_mrr@100
- dot_map@100
pipeline_tag: sentence-similarity
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:7033
- loss:GISTEmbedLoss
widget:
- source_sentence: Are Producer Companies required to maintain a general reserve?
sentences:
- '''DONATIONS OR SUBSCRIPTION BY PRODUCER COMPANY A Producer Company may, by special
resolution, make donation or subscription to any institution or individual for
the purposes of - (a) promoting the social and economic welfare of Producer Members
or producers or general public; or (b) *promoting the mutual assistance principles:*
Provided that the aggregate amount of all such donation and subscription in any
financial year shall not exceed three per cent of the net profit of the Producer
Company in the financial year immediately preceding the financial year in which
the donation or subscription was made: Provided further that no Producer Company
shall make directly or indirectly to any political party or for any political
purpose to any person any contribution or subscription or make available any facilities
including personnel or material. 581ZI. GENERAL AND OTHER RESERVES (1) Every
Producer Company shall maintain a general reserve in every financial year, in
addition to any reserve maintained by it as may be specified in articles. (2)
In a case where the Producer Company does not have sufficient funds in any financial
year for transfer to maintain the reserves as may be specified in articles, the
contribution to the reserve shall be shared amongst the Members in proportion
to their patronage in the business of that company in that year. 581ZJ. ISSUE
OF BONUS SHARES Any Producer Company may, upon recommendation of the Board and
passing of resolution in the general meeting, issue bonus shares by capitalisation
of amounts from general reserves referred to in section 581ZI in proportion to
the shares held by the Members on the date of the issue of such shares.'''
- ''' POPI will be required to submit a Utilization Certificate as per Annexure
II, in respect of funds released earlier, for processing of release proposal
from second instalment onwards. xi. POPI will maintain detailed account of
expenditure of all approved items in respect of each FPO separately and retain
all original vouchers and receipts for verification by NABARD and RSA. xii. POPI
shall submit monthly progress report to NABARD Regional Office before 5th of the succeeding
month as per **Annexure III** xiii. POPI shall constitute a \''Project Monitoring
Committee (PMC) consisting of representative of POPI, RSA, DDM **of** NABARD,
Lead District Manager, ATMA, Agriculture department and a Board member of FPO(to
be promoted). The PMC shall meet quarterly to review the progress, guide the project
execution and make recommendation for release of grant to POPI/FPO. xiv. POPI
will submit all such information and data as required for the periodic monitoring
of the project by NABARD/its representatives. POPI shall not publish the reports/research
findings/results without a written permission from NABARD. Further, NABARD shall
have the right to use the same for its internal use, training, publicity, etc.,
after duly acknowledging the source(s). xv. POPI may undertake to document
its experience during the course of implementation of the project and submit
to NABARD Regional Office for information/record. xvi. The assistance of
NABARD shall be duly acknowledged by displaying suitable sign board containing
**\''Project supported under NABARD assistance\''** at the FPO Office and also
while organising training programmes and printing of publicity/documentation material
in respect of the project. xvii. POPI shall not sub-contract the work assigned
to it to any other institution/entity. xviii. In the event of POPI availing
assistance from any other agency for any activity of the same project, NABARD''s
assistance will be reduced to that extent.'''
- '''6.3.1 Aadhaar has been made mandatory for availing Crop insurance from Kharif
2017 season onwards. Therefore, all banks are advised to mandatorily obtain
Aadhaar number of their farmers and the same applies for non-loanee farmers enrolled through banks/Insurance companies/insurance intermediaries. 6.3.2 Farmers
not having Aadhaar ID may also enrol under PMFBY subject to their enrolment for Aadhaar
and submission of proof of such enrolment as per notification No. 334.dated 8th
February, 2017 issued by GOI under Section 7 of Aadhaar Act 2016(Targeted Delivery
of Financial and other Subsidies, Benefits and Services). Copy of the notification
may be perused on www.pmfby.gov.in. This may be subject to further directions
issued by Govt. from time to time. 6.3.3 All banks have to compulsorily take
Aadhaar/Aadhaar enrolment number as per notification under Aadhaar Act before
sanction of crop loan/KCC under Interest Subvention Scheme. Hence the coverage of
loanee farmers without Aadhaar does not arise and such accounts need to be reviewed
by the concerned bank branch regularly.'''
- source_sentence: Where is the shoot borer widely distributed?
sentences:
- '''Sugarcane is an important commercial crop in India . It is cultivated under
diverse agro-climatic conditions . The crop is damaged by 5 important moth borers
. Among these borers the shoot borer, Chilo infuscatellus is an important one
and is widely distributed in all cane growing areas in India. The infestation
reduces cane production, Parthasarathy et al (1953) observed a loss in weight
of the infested clumps varying from 15.8 to 41.7 % A decrease in yield by 10 t
/ha has been calculated by Ramachandrachari (1959) Avasthy (1968) correlated the
incidence of shoot borer with cane yield and found 3.5 % loss in yield at every
5 % increase in borer infestation. High temperature , low humidity and scanty
rainfall and poor irrigation facilitate high incidence of shoot borer.'''
- '''3. Whitefly , Bemisia tabaci , Aleyrodidae, Hemiptera Symptom of damage: Yellowing
of leaves, plant vitality reduced, development of sooty mould, plant dies in case
of severe attack. Nature of damage: Nymphs and adults suck the plant sap and
also transmits yellow mosaic virus (YMV). Egg: Stalked, sub-elliptical, light
yellow at first, and turning brown later on. Eggs laid singly on adaxial (lower)
side of leaves. Nymph: Elliptical on emergence, soon they fix their mouthparts
into the plant tissues and feed on the cell sap. Greenish yellow, oval on undersurface
of leaves. Adult: Small with yellow body covered with white waxy bloom.'''
- '''Kharif / Kharif Kharif / Rabi food grains To get a higher yield of wheat, it
is necessary to pay attention to the following points: -. For field preparation,
plough first with a cultivator and then use a rotavator ''harrow''. Organic fertilizers
must be used. As much as possible, half of the nutrients should be provided by
organic fertilizers. The species should be selected according to regional compatibility
and seasonality. Pure and certified seeds should be sown after seed treatment.
Balanced amounts of fertilizers should be used at the right time and in the right
manner based on soil testing. Irrigation at critical stages (crown root stage
and flowering stage) should be done in a timely manner and in adequate quantity.
Outbreaks of wheatgrass (Phalaris minor) and wild oats should be controlled in
time. & 4S HA # (4? (A) Other activities should be completed on time based on
the recommendation |0. Seeds must be replaced after the third year.. Gerotillage
and raised bed method should be used. 2. Special care should be taken to prevent
pests and diseases. Intensive methods: In case of irrigated sowing: About 97%
of the total wheat area in the state is irrigated but assured or assured irrigation
is available in a small area. Hence, the sowing of wheat is often delayed. We
have to decide in advance which variety of paddy to choose in kharif and which
variety of wheat to sow in rabi. To get a good yield of wheat, it is necessary
to sow paddy in time, so that the field is empty for wheat in October. Another
thing to be noted is that puddling or leva in paddy causes the soil to harden.
In heavy soils, it is advisable to sow wheat by first ploughing with a soil-reversing
plough and then ploughing the soil twice with a disc harrow. Paddy stalks are
cut into small pieces using disc harrows. To decompose them quickly, 45-20 kg.
Nitrogen (as urea) per se. When preparing the field, it must be given at the first
ploughing. The field is fully prepared in a single ploughing by a tractor-driven
rotavator. |बुवाई: Wheat must be sown on time and at sufficient moisture. Late-maturing
varieties must be sown on time, otherwise the yield decreases. As sowing is delayed,
the rate of decline in wheat yields increases. Wheat yields increase from 3 to
4 kg / ha when sown from December onwards. And 4 to 5 k.g. / ha when sown in January.
The rate per week decreases. Sowing wheat with a seed drill can save fertilizer
and seed. 4'''
- source_sentence: Why is the development of Best Practices, Pilot Projects, and Success
Stories important for FPOs?
sentences:
- '''III. SAP FEEDERS 8. Shoot bug : Peregrinus maidis : Delphacidae: Hemiptera
Symptom of attack: The leaves turn yellow due to sucking; plants become weak and
the yield goes down. The mid rib of the leaves become red due to egg laying and
may dry up subsequently. Nature of damage: Both adults and nymphs suck the plant
sap from the leaves and cause the shoot to dry. They feed gregariously within
the leaf sheaths. It is not a serious pest, but sometimes causes appreciable damage.
Life stages: It is a small active, grayish brown bug. Colonies of this bug (both
adults and nymphs) live within the whorl of the central leaf or in the root region.
This pest is very common in Coimbatore during summer. The large black ant attends
these insects.'''
- '''a. Through a survey; or b. Through Focused Group Discussion Determine key
indicators for the monitoring process- Develop formats Secondary Data - The returns
submitted by the PO, data available from the Government Departments and also published
data from other projects. 10.16 What are the methods of sampling? There are 3
sampling techniques: random sampling, stratified sampling and cluster sampling
Random sampling: Sampling of households on random basis Stratified sampling: The
producers are categorized into different strata like big, medium and small. Data
are collected from each strata in a specified proportion i.e., say, every fifth
producer''s household data from the big producers, every third producer''s house
hold data from small producers every second house hold data from the very small
producers'' category Cluster sampling: In this case, data of only those producers
households will be collected who are in the cluster for a specified period 10.17
How to analyze the data? Analysis is the process of turning the detailed data
into an understanding of patterns, trends and interpretations. The step by step
process involved in monitoring analysis is enumerated below:'''
- '''i. Identification of potential FPOs among successful Watershed Development
projects, Wadi Projects and their Federations. ii. Identification of natural
clusters of farmers groups to facilitate formation of FPOs iii. Close involvement
of stakeholders such as NGOs, Banks, Govt. line departments, commodity Boards,
Corporations, Corporate, functional Universities, cooperatives, Federations, Trade
bodies, etc. for identification, promotion, nurturing, development, capacity building,
evaluation etc. of FPOs iv. Development of Best Practices, Pilot Projects and
Success Stories for wider publicity and field level replication v. Adoption
of mission mode with periodic qualitative and quantitative milestones with timelines vi. Wide
publicity to the FPO Scheme through print, electronic media and adopting other
Mass Communication Strategies vii. Conventional/non-conventional publicity and
awareness creation methods viii. Launching of pilot projects, action research
projects, experimental projects, field trials etc. to learn and understand various
models of FPOs and successful strategies for wider replication'''
- source_sentence: Apart from nutrients and protein, what role does Moong have in
pulses crops?
sentences:
- '''DONATIONS OR SUBSCRIPTION BY PRODUCER COMPANY A Producer Company may, by special
resolution, make donation or subscription to any institution or individual for
the purposes of - (a) promoting the social and economic welfare of Producer Members
or producers or general public; or (b) *promoting the mutual assistance principles:*
Provided that the aggregate amount of all such donation and subscription in any
financial year shall not exceed three per cent of the net profit of the Producer
Company in the financial year immediately preceding the financial year in which
the donation or subscription was made: Provided further that no Producer Company
shall make directly or indirectly to any political party or for any political
purpose to any person any contribution or subscription or make available any facilities
including personnel or material. 581ZI. GENERAL AND OTHER RESERVES (1) Every
Producer Company shall maintain a general reserve in every financial year, in
addition to any reserve maintained by it as may be specified in articles. (2)
In a case where the Producer Company does not have sufficient funds in any financial
year for transfer to maintain the reserves as may be specified in articles, the
contribution to the reserve shall be shared amongst the Members in proportion
to their patronage in the business of that company in that year. 581ZJ. ISSUE
OF BONUS SHARES Any Producer Company may, upon recommendation of the Board and
passing of resolution in the general meeting, issue bonus shares by capitalisation
of amounts from general reserves referred to in section 581ZI in proportion to
the shares held by the Members on the date of the issue of such shares.'''
- '''Larval rearing : It is to be done in GI round basins (28 cm dia ) at 250 larvae
/basin covered with khada cloth . The eggs of Corcyra cephlonica are given as
feeding material for the larvae in the laboratory. For rearing 500 Chrysoperla
larvae the total quantity of Corcyra eggs required is 25 CC at the rate of 5.0
CC / feeding for 5 feedings in alternate days. The Chrysoperla larvae pupated
into round white coloured silken cocoon in 10 days. The cocoons are collected
with fine brush and transferred into a one litre plastic containers with wire
mesh window for emergence of adults. From the cocoons, pale green colored adults
with transparent lace like wings emerge in 9-10 days.'''
- '''Advanced cultivation of Kharif / Kharif Rabi / Rabi pulses is the major crop
of Moong Zaid. Moong has a multifaceted role in pulses crops. Apart from providing
nutrients and protein, it also replenishes green manure by replanting crops after
plucking the pods. Etawaligarh, Deoria, Etawah, Farrukhabad, Mathura, Lalitpur,
Kanpur Dehat, Hardoi, and Ghazipur districts of the state have emerged as major
groundnut producing districts. Other districts also have potential. Good yield
can be obtained in Zaid by considering the following factors - Field preparation:
Loam land is suitable for mung bean cultivation. Ploughing two tillers makes the
field ready. If there is a shortage of seeds, they should be replanted and sown.
Farm preparation can be done quickly with tractors, power tillers, rotovators,
or other modern agricultural machinery. Recommended varieties: The following varieties
with short maturation are suitable for good yield: - Species Notification Speciality
Ripe Produce Kuntala Pest Disease Preference Suitable Area Year Period (days)
Per Hectare Utilization 2 3 4 5 6 74. Narendra Moong - 992. Dana Dhumil. . 65-70
4 - 3 yellow mosaic whole U.P. 2. Malviya 2000 green grain. 65-70 2 - 5 Tolerant,
Tadeva Sampoorna U.P. Jagrati (H.P. UM-2) 3. Emperor 2004 Green Shining. 60-65
9 - 0 Yellow Magic Whole U.P. PDM-39) Avrodhi4. Malviya Janapriya. .200] - 60-65
2 - 5 Tadaiv Sampoorn Uttar Pradesh (HUM-6) 5. Azad Moong -] 2020 Bright green.
62-65 0 - 2 MYMV, Whole U.P. (K, M-2342) Colour Medium CLS, Ansharqunose, Bold
Grain Leaf Crinkle and Web Blight Resistant and Height Fly, Jasid and Shrips Resistant
|6. IPM 32-20 2020 Green and Medium 65-85 6 MYMV, Whole U.P. Large Grain Powdery
Mildew, Resistant to Sarcosporalife Spots and Resistant to Whitefly and Shrips.
< 84 >'''
- source_sentence: What is the purpose of heating the fresh fruit tissue in alcohol
or HCl?
sentences:
- '''a. Business Processes: Aggregation, segregation and logistics b. Productivity:
Man, material, money, input and output c. Warehousing: Space, costs and logistics
d. Processing : Own vs. out-source e. Products: Whole foods to processed
foods and to derivatives f. Risk mitigation 7.4 What is a business plan? Business
plan is a succinct document that specifies the components of a strategy with regard
to the business mission, external and internal environments and problems identified
in earlier analysis. A business plan is not written each time a modification to
a strategy is made. It should be written when a new venture is developed or a
major new initiative is launched. Sincere contemplation is needed about the business
concept, the business opportunity, the competitive landscape, the essential elements
for success, and the people who will be involved. The exercise will often lead
to more questions, and these new questions must be properly researched to gain
deep insight into the issues and challenges that lie ahead. In short, the business
plan must contain answers to the questions \''Who/What/Where/When/Why/How/How
Much\''. 7.5 What is business planning? The business planning process starts with
Generation of Business Ideas, followed by Opportunities & Threats Analysis leading
to Identification of suitable Business Opportunities. Once Business Opportunity
is identified, a Marketing Plan is prepared. The final part of the process deals
with the Financial Plan.'''
- '''The fresh fruit tissue or separated parts, including the peel and core are
heated in 95% alcohol or 0.05N HCl (pH 2.0) for 10-20 min at 70 o C to inactivate
pectic enzymes. After the pretreatment, the materials is ground in an electric
blender and placed in water. Versene or Na-EDTA is added at 2.0%. The pH is adjusted
to 6.0. The mixture is heated for about an hour at 90-95 o C. The slurry formed
is rapidly filtered and the pectin is precipitated from the solution using acidified
alcohol. The precipitate is centrifuged and repeatedly washed with 70% alcohol.
Acetone is used for dehydration and the pectin produced is vacuum-dried. It may
also be dried in a hot-air oven at 50 o C for 4 h.'''
- '''Advanced cultivation of Kharif / Kharif Kharif / Rabi foodgrains Paddy is the
major crop of the state in Kharif. It is the largest area sown / sown and has
great potential to increase productivity |यह. To achieve higher rice yields, the
following factors must be taken into consideration: |2. Select the recommended
varieties of paddy according to local conditions such as regional climate, soil,
irrigation facilities, water logging, and suitability for sowing and transplanting.
|मृदा Sow pure, certified and researched seeds |मृदा On a trial basis, timely
and recommended quantities of balanced fertilizers, organic manure, and green
manure. Make good use of the available irrigation potential by timely sowing /
transplanting. The number of plants per unit area should be ensured. |कीट Disease
and weed control should be done. |कम The ratio of fertilizers should be kept 2:
4: 4 even in the case of fertilizer availability. |4 Preparation of the field
should be done by ploughing 2 - 3 after ploughing the land. At the same time,
the farm should be made strong so that rainwater can be stored in the field for
a long time. If green manure is being taken then phosphorus should be used along
with its sowing. Irrigate the field a week before sowing / transplanting paddy
so that weeds grow. Volume per hectare 60-75 kg. Mix rotten cow dung manure, sprinkle
with lukewarm water and leave it in shade for 8-0 days, then add it to the fields
at the time of last mowing to protect them from pests such as termites, white
weeds, nematodes, root bugs, cutworms, etc. Volume per hectare 60-75 kg. After
sprinkling light water mixed with cow dung manure and keeping it in shade for
8-40 days, the land should be tilled at the last ploughing before sowing. P749AF
#े 3. Rice cultivation in the region is done by direct sowing and transplanting
in non-irrigated and irrigated conditions. The recommended varieties of paddy
for different climatic zones and conditions of the state are mentioned in Table-4.
The qualities and characteristics of the main varieties are also listed in Table-2.
04'''
model-index:
- name: SentenceTransformer based on BAAI/bge-small-en-v1.5
results:
- task:
type: information-retrieval
name: Information Retrieval
dataset:
name: val evaluator
type: val_evaluator
metrics:
- type: cosine_accuracy@1
value: 0.45012787723785164
name: Cosine Accuracy@1
- type: cosine_accuracy@5
value: 0.8580562659846548
name: Cosine Accuracy@5
- type: cosine_accuracy@10
value: 0.9207161125319693
name: Cosine Accuracy@10
- type: cosine_precision@1
value: 0.45012787723785164
name: Cosine Precision@1
- type: cosine_precision@5
value: 0.17161125319693094
name: Cosine Precision@5
- type: cosine_precision@10
value: 0.09207161125319692
name: Cosine Precision@10
- type: cosine_recall@1
value: 0.45012787723785164
name: Cosine Recall@1
- type: cosine_recall@5
value: 0.8580562659846548
name: Cosine Recall@5
- type: cosine_recall@10
value: 0.9207161125319693
name: Cosine Recall@10
- type: cosine_ndcg@5
value: 0.6776887809935845
name: Cosine Ndcg@5
- type: cosine_ndcg@10
value: 0.6982045153363013
name: Cosine Ndcg@10
- type: cosine_ndcg@100
value: 0.7149326391576375
name: Cosine Ndcg@100
- type: cosine_mrr@5
value: 0.6166240409207153
name: Cosine Mrr@5
- type: cosine_mrr@10
value: 0.6252430682417891
name: Cosine Mrr@10
- type: cosine_mrr@100
value: 0.6289243546015818
name: Cosine Mrr@100
- type: cosine_map@100
value: 0.6289243546015826
name: Cosine Map@100
- type: dot_accuracy@1
value: 0.4514066496163683
name: Dot Accuracy@1
- type: dot_accuracy@5
value: 0.8580562659846548
name: Dot Accuracy@5
- type: dot_accuracy@10
value: 0.9207161125319693
name: Dot Accuracy@10
- type: dot_precision@1
value: 0.4514066496163683
name: Dot Precision@1
- type: dot_precision@5
value: 0.17161125319693094
name: Dot Precision@5
- type: dot_precision@10
value: 0.09207161125319692
name: Dot Precision@10
- type: dot_recall@1
value: 0.4514066496163683
name: Dot Recall@1
- type: dot_recall@5
value: 0.8580562659846548
name: Dot Recall@5
- type: dot_recall@10
value: 0.9207161125319693
name: Dot Recall@10
- type: dot_ndcg@5
value: 0.6781607378304497
name: Dot Ndcg@5
- type: dot_ndcg@10
value: 0.6986764721731665
name: Dot Ndcg@10
- type: dot_ndcg@100
value: 0.7154045959945029
name: Dot Ndcg@100
- type: dot_mrr@5
value: 0.6172634271099737
name: Dot Mrr@5
- type: dot_mrr@10
value: 0.6258824544310474
name: Dot Mrr@10
- type: dot_mrr@100
value: 0.6295637407908401
name: Dot Mrr@100
- type: dot_map@100
value: 0.6295637407908409
name: Dot Map@100
---
# SentenceTransformer based on BAAI/bge-small-en-v1.5
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-small-en-v1.5](https://huggingface.co/BAAI/bge-small-en-v1.5)
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 384 tokens
- **Similarity Function:** Cosine Similarity
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("SamagraDataGov/embedding_finetuned")
# Run inference
sentences = [
'What is the purpose of heating the fresh fruit tissue in alcohol or HCl?',
"'The fresh fruit tissue or separated parts, including the peel and core are heated in 95% alcohol or 0.05N HCl (pH 2.0) for 10-20 min at 70 o C to inactivate pectic enzymes. After the pretreatment, the materials is ground in an electric blender and placed in water. Versene or Na-EDTA is added at 2.0%. The pH is adjusted to 6.0. The mixture is heated for about an hour at 90-95 o C. The slurry formed is rapidly filtered and the pectin is precipitated from the solution using acidified alcohol. The precipitate is centrifuged and repeatedly washed with 70% alcohol. Acetone is used for dehydration and the pectin produced is vacuum-dried. It may also be dried in a hot-air oven at 50 o C for 4 h.'",
"'Advanced cultivation of Kharif / Kharif Kharif / Rabi foodgrains Paddy is the major crop of the state in Kharif. It is the largest area sown / sown and has great potential to increase productivity |यह. To achieve higher rice yields, the following factors must be taken into consideration: |2. Select the recommended varieties of paddy according to local conditions such as regional climate, soil, irrigation facilities, water logging, and suitability for sowing and transplanting. |मृदा Sow pure, certified and researched seeds |मृदा On a trial basis, timely and recommended quantities of balanced fertilizers, organic manure, and green manure. Make good use of the available irrigation potential by timely sowing / transplanting. The number of plants per unit area should be ensured. |कीट Disease and weed control should be done. |कम The ratio of fertilizers should be kept 2: 4: 4 even in the case of fertilizer availability. |4 Preparation of the field should be done by ploughing 2 - 3 after ploughing the land. At the same time, the farm should be made strong so that rainwater can be stored in the field for a long time. If green manure is being taken then phosphorus should be used along with its sowing. Irrigate the field a week before sowing / transplanting paddy so that weeds grow. Volume per hectare 60-75 kg. Mix rotten cow dung manure, sprinkle with lukewarm water and leave it in shade for 8-0 days, then add it to the fields at the time of last mowing to protect them from pests such as termites, white weeds, nematodes, root bugs, cutworms, etc. Volume per hectare 60-75 kg. After sprinkling light water mixed with cow dung manure and keeping it in shade for 8-40 days, the land should be tilled at the last ploughing before sowing. P749AF #े 3. Rice cultivation in the region is done by direct sowing and transplanting in non-irrigated and irrigated conditions. The recommended varieties of paddy for different climatic zones and conditions of the state are mentioned in Table-4. The qualities and characteristics of the main varieties are also listed in Table-2. 04'",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
## Evaluation
### Metrics
#### Information Retrieval
* Dataset: `val_evaluator`
* Evaluated with [InformationRetrievalEvaluator
](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
| Metric | Value |
|:--------------------|:-----------|
| cosine_accuracy@1 | 0.4501 |
| cosine_accuracy@5 | 0.8581 |
| cosine_accuracy@10 | 0.9207 |
| cosine_precision@1 | 0.4501 |
| cosine_precision@5 | 0.1716 |
| cosine_precision@10 | 0.0921 |
| cosine_recall@1 | 0.4501 |
| cosine_recall@5 | 0.8581 |
| cosine_recall@10 | 0.9207 |
| cosine_ndcg@5 | 0.6777 |
| cosine_ndcg@10 | 0.6982 |
| cosine_ndcg@100 | 0.7149 |
| cosine_mrr@5 | 0.6166 |
| cosine_mrr@10 | 0.6252 |
| cosine_mrr@100 | 0.6289 |
| cosine_map@100 | 0.6289 |
| dot_accuracy@1 | 0.4514 |
| dot_accuracy@5 | 0.8581 |
| dot_accuracy@10 | 0.9207 |
| dot_precision@1 | 0.4514 |
| dot_precision@5 | 0.1716 |
| dot_precision@10 | 0.0921 |
| dot_recall@1 | 0.4514 |
| dot_recall@5 | 0.8581 |
| dot_recall@10 | 0.9207 |
| dot_ndcg@5 | 0.6782 |
| dot_ndcg@10 | 0.6987 |
| dot_ndcg@100 | 0.7154 |
| dot_mrr@5 | 0.6173 |
| dot_mrr@10 | 0.6259 |
| dot_mrr@100 | 0.6296 |
| **dot_map@100** | **0.6296** |
## Training Details
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `gradient_accumulation_steps`: 4
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `num_train_epochs`: 1.0
- `warmup_ratio`: 0.1
- `load_best_model_at_end`: True
#### All Hyperparameters
Click to expand
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 8
- `per_device_eval_batch_size`: 8
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 4
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 1.0
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: True
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: False
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `eval_use_gather_object`: False
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: proportional
### Training Logs
| Epoch | Step | Training Loss | loss | val_evaluator_dot_map@100 |
|:----------:|:-------:|:-------------:|:----------:|:-------------------------:|
| 0.0682 | 15 | 0.5269 | 0.3693 | 0.6033 |
| 0.1364 | 30 | 0.2825 | 0.2129 | 0.6057 |
| 0.2045 | 45 | 0.3093 | 0.1710 | 0.6080 |
| 0.2727 | 60 | 0.1677 | 0.1486 | 0.6196 |
| 0.3409 | 75 | 0.2368 | 0.1256 | 0.6199 |
| 0.4091 | 90 | 0.161 | 0.1113 | 0.6255 |
| 0.4773 | 105 | 0.1452 | 0.1006 | 0.6256 |
| 0.5455 | 120 | 0.1323 | 0.1008 | 0.6266 |
| 0.6136 | 135 | 0.1138 | 0.0986 | 0.6270 |
| 0.6818 | 150 | 0.1129 | 0.0954 | 0.6289 |
| 0.75 | 165 | 0.1322 | 0.0914 | 0.6290 |
| 0.8182 | 180 | 0.2063 | 0.0898 | 0.6307 |
| 0.8864 | 195 | 0.1055 | 0.0891 | 0.6300 |
| **0.9545** | **210** | **0.0931** | **0.0888** | **0.6296** |
| 1.0 | 220 | - | 0.0888 | 0.6296 |
* The bold row denotes the saved checkpoint.
### Framework Versions
- Python: 3.10.14
- Sentence Transformers: 3.0.1
- Transformers: 4.43.4
- PyTorch: 2.4.1+cu121
- Accelerate: 0.33.0
- Datasets: 2.21.0
- Tokenizers: 0.19.1
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### GISTEmbedLoss
```bibtex
@misc{solatorio2024gistembed,
title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning},
author={Aivin V. Solatorio},
year={2024},
eprint={2402.16829},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
```