embedding-finetuned / README.md
smokxy's picture
pytorch_model.bin upload/update
fe85a47 verified
metadata
language: []
library_name: sentence-transformers
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - generated_from_trainer
  - dataset_size:900
  - loss:GISTEmbedLoss
base_model: BAAI/bge-small-en-v1.5
datasets: []
metrics:
  - cosine_accuracy@1
  - cosine_accuracy@5
  - cosine_accuracy@10
  - cosine_precision@1
  - cosine_precision@5
  - cosine_precision@10
  - cosine_recall@1
  - cosine_recall@5
  - cosine_recall@10
  - cosine_ndcg@5
  - cosine_ndcg@10
  - cosine_ndcg@100
  - cosine_mrr@5
  - cosine_mrr@10
  - cosine_mrr@100
  - cosine_map@100
  - dot_accuracy@1
  - dot_accuracy@5
  - dot_accuracy@10
  - dot_precision@1
  - dot_precision@5
  - dot_precision@10
  - dot_recall@1
  - dot_recall@5
  - dot_recall@10
  - dot_ndcg@5
  - dot_ndcg@10
  - dot_ndcg@100
  - dot_mrr@5
  - dot_mrr@10
  - dot_mrr@100
  - dot_map@100
widget:
  - source_sentence: Who are the CSCs engaged to enrol non-loanee farmers?
    sentences:
      - >-
        'notification or /and on National Crop Insurance Portal multiplied by
        sown area for notified crop.    3.1.3   Special efforts shall be made to
        ensure maximum coverage of SC/ ST/ Women farmers under the    Scheme.
        Further Panchayat Raj Institutions (PRIs) may be involved  in extension
        and awareness    creation amongst farmers and obtaining feed-back of
        farmers about the implementation of the    Scheme   3.1.4   The
        implementing Insurance Company selected as L1 will be responsible for
        taking necessary  measures to ensure at least 10% incremental increase
        in coverage of non-loanee farmers. However  other empanelled Insurance
        Companies which have participated in the bidding and are keen for 
        enrolment of non loanee farmers in the cluster may also be allowed to
        enrol non-loanee farmers at L1 premium rate. The interested companies
        have to inform their willingness in writing within seven days of
        finalisation of tender/issuance of work order to L1. It will however be
        the responsibility of all the  Insurance Companies engaged in this
        process to ensure that duplicate enrolment does not happen in the given
        cluster/district. Engaging companies other than L1 for enrolling non
        loanee farmers will be taken up on a pilot basis in Districts notified
        by State Govt.  They shall enrol non loanee farmers as per  conditions
        laid down in Para 17.5.   3.1.5   These Insurance Company will maintain
        separate data of such non loanee farmers covered by them and enter the
        said data on the portal as per seasonality discipline detailed in Para
        16.2. They shall be  liable for payment of claims to such farmers. 
        3.1.6   The exchange of information, co-witnessing of CCEs and sharing
        of yield data etc for the cluster by  Government/NCIP will be limited to
        L1 Company only and it will be binding on other companies to  accept it.
        However, the requisition for payment of Government subsidy in respect of
        non-loanee  enrolled by them will be submitted directly to the Govt
        designated agency.'
      - >-
        'Name of Implementing Agency
        (NABARD/NCDC):............................................. Address:
        ...........................................................................................................
        ...........................................................................................................
        ................................................................................................................. 
        Phone Number:
        .............................................................................   
        (Each page of the application form should be signed by Branch head and
        Zonal Manager)    Name and Address of the applicant Bank Branch :    1
        a) Complete Postal Address (*with pin-code) :    1 b) Phone No. with STD
        :    1 c)  Fax No.:    1 d) E-Mail Address:      1 e) Details of the
        authorised  Designation  Mobile No.  E-Mail Address.  person of the Bank
        submitting the Claim:  2  Name of Borrower FPO :  2 a) Constitution: 
        Producer Organization  2 b) Registered Office Address (*with pin-code): 
        (i). Phone No.  (ii). Fax No.  (iii). E-mail Address  2 c)  Business
        Office Address (if any)  (i). Phone No.  (ii). Fax No.  (iii). E-mail
        Address  2 d) Name of CEO :  Mobile No.  2 e) Credit Facility for which
        guarantee cover sought :    Old  New  Expansion  Technical Upgradation 
        2 f ) Give details of components:-   Inputs:    Processing: 
        Marketing:    Any other:  Total Investment:  3  Banking Facilities
        Sanctioned by sanctioning authority (Rs. in Lakh):-      (i). Term-Loan
        :  Date of Sanction: Amount  Outstanding:  IRAC Status:  IRAC
        Status:     (ii).Cash Credit :  Date of Sanction: Amount  Outstanding: 
        3 a)  Sanctioning Office:  Branch:  ZO / RO:  HO:    3 b)  Designation
        of Sanctioning Authority :    3 c)  Sanctioning authority approval vide
        :    3 d)  Sanction / Appraisal Note No.  Dated:    3 e)  Agenda No. /
        Minutes conveying sanction :    4  Name and Address of Controlling
        Office of the Branch (*with pin-code):    4.a).  Name of Controlling
        Authority :    4.b).  Mobile No.:    4.c).  Fax. No. :    4.d).  E-Mail
        Address. :    5  Present status of FPO Activity : (Give component wise
        details)    5. a)   5. b).   5. c).   5. d).   5. e).   5. f )   6 
        Status of Accounts  6. a). Term-Loan:  Amount of Disbursement till date
        :  Outstanding as on date :      i).'
      - >-
        '8.1    CSCs under Ministry of Electronics and Information Technology
        (MeITY) have been engaged to enrol    non-loanee farmers. The Insurance
        Companies are required to enter into a separate agreement with    CSC
        and pay service charges as fixed by DAC&FW, GOI per farmer per village
        per season. No other    agreement or payment is required to be made for
        this purpose. Nodal agency for engagement with    Ministry of
        Agriculture and Farmers Welfare and Insurance Companies will be CSC-SPV,
        a company    established under MeITY for carrying out e-governance
        initiatives of GoI.  8.2    No charges/fee shall be borne or paid by the
        farmers being enrolled through CSCs i.e. CSC-SPV and    CSC-VLE  8.3   
        As per IRDA circular, no separate qualification/certification will be
        required for the VLEs of CSCs to    facilitate enrolment of non-loanee
        farmers.  8.4    All empanelled Insurance Companies will compulsorily be
        required to enter into an agreement with    CSC for enrolment of
        non-loanee farmers and for provision of other defined services to
        farmers.   8.5    Other designated intermediaries may be linked with the
        Portal in due course.   8.6    Empanelled Insurance Companies have to
        necessarily register on the portal and submit list and details    of
        agents/intermediaries engaged for enrolment of non-loanee farmers in the
        beginning of each    season  within 10 days of award of work in the
        State.  Further all agents/intermediaries have to work    strictly as
        per the provisions of the Scheme and IRDA regulations'
  - source_sentence: What role does the N-PMAFSC play in the modification of the scheme?
    sentences:
      - >-
        'Eligible FPOs shall apply for the Equity Grant in the prescribed
        Application Form  (**Annexure-I**) only. Other mandatory documents
        required to be submitted along with the Application are listed below:  
        (i) Shareholder List and Share Capital contribution by each member
        verified and  certified by a Chartered Accountant (CA)/Co-operative
        Auditors prior to  submission(Enclosure-I of Annexure-I).  (ii)
        Resolution of the Board of Directors/Governing Body to seek Equity Grant
        for  members **(Enclosure-II of Annexure-I).**  (iii) Consent of
        shareholders, stating name of shareholder, gender, number of  shares
        held, face value of shares, land holding, signifying consent for
        Implementing Agencies to directly transfer the Equity Grant sanctioned
        to the FPO on their behalf, to FPO Bank account, against the
        consideration of additional shares of equivalent value to be issued to
        them by FPO and on  exit- transfer of the shares as per rules
        **(Enclosure-III of Annexure-I).**  (iv) If the FPO is in operation for
        more than one or more financial year then it  shall provide copy of the
        Audited Financial Statements of FPO for all years of existence of the
        FPO, verified and certified by a Chartered Accountant (CA)/ Cooperative
        Auditors prior to submission.  (v) In case FPO is in operation for
        period of less than one financial year,  Photocopy of Bank Account
        Statement for last six months authenticated by the Branch Manager of the
        \'Bank\' is required  (vi) Business Plan of FPO and budget for next 18
        months. (vii) Names, photographs, and identity proof (anyone from among
        ration card,  Aadhaar card, election identification card, passport) of
        Representatives/ Directors authorized by the Board for executing and
        signing all documents under the Scheme. Each page of the Application
        Form and accompanying documents shall be signed by a minimum of two
        Board Member /Authorized Representatives of the FPO.'
      - >-
        '20.3.1   For addressing the issue of reliability of CCEs in terms of
        their accuracy, representativeness and  timeliness, innovative
        technologies such as satellite remote sensing, drone, modeling,
        AWS/ARG,  real  time transmission of data etc. should be utilized. This
        will ensure accurate assessment of yield and  timely payment of claims
        to farmers. Various studies carried out by national and international
        organizations, including MNCFC, NRSC, SAC, CCAFS, IRRI, IFPRI, World
        Bank, etc. have shown that the  use of satellite, weather, soil and crop
        data, along with images/video capture of crop growth at various stages
        and accurate sample CCE data collection can improve the yield data
        quality/ timeliness and support timely claim processing and payments.  
        20.3.2   States, with the support of national centres as mentioned
        above, SRSC and SAUs, need to carry out  adequate number of pilot
        studies for improved yield estimation using technology, as mentioned 
        above, and small number of good quality CCEs. When a significant
        correlation is observed between remote sensing and weather estimated
        yield and yield estimated through CCEs, States and Insurance Companies
        can use these technologies in estimating the crop yields at IU level,
        subject to the satisfaction of both States and Insurance Companies about
        the accuracy of the yield estimates, to service the claims.'
      - >-
        '(i)  Coordinate with all the Implementing Agencies, State Level
        Consultative Committee and District level Monitoring Committee (D-MC)
        for smooth implementation. It will also consider feedback received from
        other relevant Ministries and Organizations on Clusters identification
        for consideration.  (ii)  It will monitor the progress either by holding
        the meetings of Implementing  Agencies and other stakeholders or by
        other means.  (iii) It will allocate the produce
        clusters/districts/States to Implementing  Agencies for formation and
        promotion of FPOs.  (iv) It will undertake scrutiny of Action Plan of
        Implementing Agencies(IAs),  consider recommendation of release of fund
        to Implementing Agencies based on previous utilization as due   with
        respect to funding under the  Scheme.  (v)  It will provide policy
        inputs to DAC&FW for modification in the Scheme to  better suit in the
        formation and promotion of FPOs to make them economically sustainable. 
        (vi) It will provide aid and advice to Implementing Agencies as may be
        required  for smooth functioning of the scheme.  (vii) Based on
        suggestions received from various Implementing Agencies, other 
        Ministries, States and experience/need, N-PMAFSC may examine and 
        recommend revision of the minimum membership norm per FPO to DAC&FW. 
        (viii) It may seek detailed input and analysis as may be required from
        time to  time from NPMA and also seek assistance of DMI in verification
        etc.'
  - source_sentence: When should non-loanee farmers apply for the Rabi season?
    sentences:
      - >-
        ' Date………………………………   ……………………………… Signature of Branch Manager with
        branch seal  Name…………………………………… … Designation ……………………………………
        ………………………………  ……………………………… Signature of Authorized Person in zonal
        office Name………………………………… Designation ……………………………………  5. Promoter's
        request letter  List of Enclosures  1. Recommendation  9. List of
        shareholders  addressed to the Bank Manager on original letter head of
        FPO  confirmed by promoter and bank  with amount of CGC  sought on
        Bank's  Original letterhead with date and dispatch number duly signed by
        the Branch Manager on each page.  2. Sanction letter of  6.
        Implementation Schedule  10. Affidavit of promoters that  confirmed by
        the bank.  they have not availed CGC  from any other institution for 
        sanctioned Credit Facility.  sanctioning authority  addressed to
        recommending  branch.  3. Bank's approved  7. Up-to-date statement of
        account of  11. Field inspection report of  Term loan and Cash Credit
        (if Sanctioned).  Bank official as on recent date.  Appraisal/Process
        note bearing signature of sanctioning authority.  4. Potential Impact
        on  8. a).Equity Certificate, C.A/CS  * Pin Code at Column No. 1. a), 
        certificate/RCS certificate  2. b), 2. c), 4. a) and 9. a) is Mandatory 
        b). FORM-2, FORM-5 and FORM-23  filed with ROC for Company/RCS.  small
        farmer producers  1. Social Impact,  2. Environmental  Impact  3.'
      - >-
        ' \'Tenure of Guarantee Cover\' means the agreed tenure of the Term
        loan/  composite credit i.e. the maximum period of Guarantee Cover from
        the Guarantee start-up which shall run through the agreed tenure of the
        term credit, and where working capital facilities or Term loan alone are
        extended  and/or  continuing working capital arrangements granted along
        with the Term Loan, for a period of 5  years  or block of   5 years
        and/or loan / working capital credit or  composite credit facilities'
        termination date, whichever is earlier or such period  as may be
        specified by the NABARD or NCDC, as case may be.'
      - >-
        'for loanee  and within 30 days for non loanee i.e. 15th Aug  for Kharif
        and 15th  Jan for Rabi for loanee and  31st Aug for Kharif and 31st  Jan
        for Rabi for  Non Loanee  13  Within  7 days  from the date of
        intimation by ICs  CSCs/Banks/ Intermediary  Cut-off date for
        CSCs/Banks/Intermediary to correct/update the  paid application
        intimated by ICs on Crop Insurance Portal  14  Cut-off date for Insurer
        to accept the corrected/updated applications  Within  7 days  from the
        date of submission of correction/updation by the Bank/CSC   Insurance
        Companies  15  Within 7 days from acceptance of proposal by concerned
        Insurance Company on Portal  Cut-off date for Banks/ICs to hand over
        insurance acknowledgement receipt along with folio to the insured
        farmer  Banks/ICs for enrolment through their intermediaries  16  Cut
        off date for processing of applications by ICs and auto approval of
        application of insured farmers on crop insurance Portal  60 days from
        the cut off date for enrolment/debit of premium from farmers i.e.  15th
        September  for Kharif and 15th February for  Rabi seasons  17  Before
        cut off date of enrolment of farmers  Insurance Companies/GOI /State  
        Cut off date for raising bills/requisitions with supporting documents
        for releasing of advance premium subsidy based on 50% of  80% of
        respective share of Centre/State in corresponding previous season  18 
        Release of advance upfront premium subsidy  (First Instalment)i.e. 50%
        of 80% of respective share of Centre/State in corresponding  previous
        season  Within 15days of cut off date of enrolment of farmers i.e.
        31st   July for Kharif   Upto  15th August*  19  *state may fix earlier
        dates  for early Kharif crops  Training and registration of field level
        workers assigned for conduct of CCEs and reporting of  the same on crop
        insurance Portal through smart phones/CCE Agri App  Upto31st August*
        *state  20  Registration of mobile number of representative of ICs for
        co-witnessing of CCEs  may fix earlier dates for early  Kharif crops  At
        least 7 days before tentative date  for conducting CCEs  21  a)
        Uploading of tentative schedule/date for conducting CCEs (crop-wise/IU
        wise) followed  by SMS on one day notice through CCEs app.'
  - source_sentence: >-
      What is the requirement for the shareholder list and share capital
      contribution?
    sentences:
      - >-
        'To substantiate the fact, the most successful example is of dairy
        co-operative in India where professional managers have contributed
        immensely to make it a success. There are other so many examples which
        prove the absolute requirement of professional managers. The number of
        professional staff could depend on geographical spread of business
        operation, diversity of activities and volume of business. However, an
        FPO should have minimum a CEO/Manager and an Accountant. Accountant is
        required in FPO to look after its day to day accounting work. Based on
        requirement, FPO can engage other staff also.   10.3 The CEO/Manager is
        to be appointed by the executive body of the FPO who  should be either
        graduate in agriculture / agriculture marketing / agri-business
        management or BBA or equivalent. Locally available professionals with
        10+2 and  preferably diploma in agriculture / agriculture marketing /
        agri-business management or in such other related areas may be
        preferable. The accountant should have educational qualification of 10+2
        with Mathematics as a compulsory subject or alternatively with  Commerce
        or Accountancy background. If any members of the FPO meet the above
        criteria, they may be considered preferably in the selection process. 
        10.4 Under the scheme, financial support towards salary of CEO/Manager
        up to         @ Rs. 25,000/- per month and of Accountant up to @
        Rs.10,000/- per month with annual increment up to 5% is to be provided
        from the earmarked financial support for first 3 years only. Thereafter,
        FPOs will manage from their own resources to pay the salary of
        CEO/Manager and Accountant. In order to create  interest of good
        professional activities of CEO/Accountant, the FPO may also offer higher
        payment with their own sources of funds on above of Govt. support. One
        CEO will provide full time services to one FPO at a time only.'
      - >-
        'i. Shareholder List and Share Capital contribution by each Member
        verified and certified by a Chartered Accountant (CA) prior to
        submission (Format attached, Annexure I- Enclosure-I). ii. Resolution of
        FPO Board/Governing Council to seek Equity Grant for Members (Format
        attached, Annexure I- Enclosure-II).  iii. Consent of Shareholders,
        stating name of shareholder, gender, number of shares held, face value
        of shares, land holding, and signature, signifying consent for
        Implementing Agency to directly transfer the Equity Grant sanctioned to
        the FPC on their behalf, to FPC Bank account, against the consideration
        of additional shares of equivalent value to be issued to them by FPC and
        on exit- transfer of the shares as per rules (Format attached, Annexure
        I-Enclosure-III).   iv. Audited Financials of FPO for a minimum 1
        year/for all years of existence of the FPO if formed less than three
        years prior to application/ for the last 3 years for FPO in existence
        for 3 years or more, verified and certified by a Chartered Accountant
        (CA) prior to submission. v. Photocopy of FPO Bank Account Statement for
        last six months authenticated by Branch Manager. vi. Business plan and
        budget for next 18 months. vii. Names, photographs, and identity proof
        (one from among ration card, Aadhaar card, election identification card,
        and passport of Representatives/ Directors authorized by the Board for
        executing and signing all documents under the Scheme. viii. Each page of
        Application Form   and accompanying documents should be signed by a
        minimum of two Board Member Authorised Representatives of FPO;'
      - >-
        '9.1 The Formation and Incubation cost of CBBO, limited to maximum of 
        Rs. 25  lakh / FPO  of support or actual  which is lesser, is to be
        provided for five years from the year of formation. It includes cost
        towards undertaking baseline survey, mobilization of farmers, organizing
        awareness programmes and conducting exposure visits, professional hand
        holdings, incubation, cost of engaging CBBOs and other overheads etc.
        There is also a provision for cost of NPMA towards   manpower,
        establishment, travel and advisory and  maintaining MIS portal. This
        also includes a provision towards cost for development of appropriate
        overall ICT based MIS web portal for the Scheme.'
  - source_sentence: What does the term 'shareholder members' refer to?
    sentences:
      - >-
        '(i)  Coordinate with all the Implementing Agencies, State Level
        Consultative Committee and District level Monitoring Committee (D-MC)
        for smooth implementation. It will also consider feedback received from
        other relevant Ministries and Organizations on Clusters identification
        for consideration.  (ii)  It will monitor the progress either by holding
        the meetings of Implementing  Agencies and other stakeholders or by
        other means.  (iii) It will allocate the produce
        clusters/districts/States to Implementing  Agencies for formation and
        promotion of FPOs.  (iv) It will undertake scrutiny of Action Plan of
        Implementing Agencies(IAs),  consider recommendation of release of fund
        to Implementing Agencies based on previous utilization as due   with
        respect to funding under the  Scheme.  (v)  It will provide policy
        inputs to DAC&FW for modification in the Scheme to  better suit in the
        formation and promotion of FPOs to make them economically sustainable. 
        (vi) It will provide aid and advice to Implementing Agencies as may be
        required  for smooth functioning of the scheme.  (vii) Based on
        suggestions received from various Implementing Agencies, other 
        Ministries, States and experience/need, N-PMAFSC may examine and 
        recommend revision of the minimum membership norm per FPO to DAC&FW. 
        (viii) It may seek detailed input and analysis as may be required from
        time to  time from NPMA and also seek assistance of DMI in verification
        etc.'
      - >-
        '19.1   It has been seen, during first two years of implementation of
        PMFBY, there are various types of yield disputes, which unnecessarily
        delays the claim settlement. Following figure shows the procedures to 
        be adopted in various cases.    Figure. Procedures to be followed in
        different yield dispute cases     19.2   Wherever the yield estimates
        reported at IU level are abnormally low or high vis-à-vis the general
        crop  condition the Insurance Company in consultation with State Govt.
        can make use of various products (e.g. Satellite based Vegetation Index,
        Weather parameters, etc.) or other technologies (including  statistical
        test, crop models etc.) to confirm yield estimates. If Insurance Company
        witnesses any  anomaly/deficiency in the actual yield data(partial
        /consolidated) received from the State Govt., the  same shall be brought
        into the notice of concerned State department within 7 days from date of
        receipt of yield data with specific observations/remarks under
        intimation to Govt. of India and anomaly, if any, may be resolved  in
        next 7 days by the  State Level Coordination Committee (SLCC)  headed by
        Additional Chief Secretary/Principal Secretary/Secretary of the
        concerned department. This committee shall be authorized to decide all
        such cases and the decision in such cases shall be final. The SLCC may
        refer the case to State Level Technical Advisory Committee (STAC) for
        dispute resolution (Constitution of STAC is defined in Para 19.5). In
        case the matter stands unresolved even after examination by STAC, it may
        be escalated to TAC along with all relevant documents including minutes
        of meetings/records of discussion and report of the STAC and SLCC.
        Reference to TAC can be made thereafter only in conditions specified in
        Para 19.7.1 However, data with anomalies which is not reported within 7
        days will be treated as accepted to insurance company.'
      - >-
        'Date:  To, (i) The Managing Director Small Farmers' Agri-Business
        Consortium (SFAC), NCUI Auditorium, August Kranti Marg, Hauz Khas, New
        Delhi 110016. (ii)The Managing Director National Co-operative
        Development Corporation (NCDC), 4, Siri Institutional Area, Hauz Khas,
        New Delhi 110016. (iii) The Chief General Manager National Bank for
        Agriculture and Rural Development (NABARD), Regional Office
        --------------------------------------------------------------- (iv) To
        any other additional Implementing Agency allowed/designated, as the case
        may be. Sub: Application for Equity Grant under scheme of Formation and
        Promotion of 10,000  Farmer Producer Organizations (FPOs)  Dear
        Sir/Madam, We herewith apply for Equity Grant as per the provisions
        under the captioned scheme.  1. The details of the FPO are as under-  
        S. No.  Particulars to be furnished  Details  1.   Name of the FPO  2.  
        Correspondence address of FPO  3.   Contact details of FPO  4.  
        Registration Number  5.   Date of registration/incorporation of FPO 
        6.   Brief account of business of FPO  7.   Number of Shareholder
        Members  8.    Number of Small, Marginal and Landless Shareholder
        Members'
pipeline_tag: sentence-similarity
model-index:
  - name: SentenceTransformer based on BAAI/bge-small-en-v1.5
    results:
      - task:
          type: information-retrieval
          name: Information Retrieval
        dataset:
          name: val evaluator
          type: val_evaluator
        metrics:
          - type: cosine_accuracy@1
            value: 0.43
            name: Cosine Accuracy@1
          - type: cosine_accuracy@5
            value: 0.87
            name: Cosine Accuracy@5
          - type: cosine_accuracy@10
            value: 0.92
            name: Cosine Accuracy@10
          - type: cosine_precision@1
            value: 0.43
            name: Cosine Precision@1
          - type: cosine_precision@5
            value: 0.17399999999999996
            name: Cosine Precision@5
          - type: cosine_precision@10
            value: 0.09199999999999997
            name: Cosine Precision@10
          - type: cosine_recall@1
            value: 0.43
            name: Cosine Recall@1
          - type: cosine_recall@5
            value: 0.87
            name: Cosine Recall@5
          - type: cosine_recall@10
            value: 0.92
            name: Cosine Recall@10
          - type: cosine_ndcg@5
            value: 0.6778743824685509
            name: Cosine Ndcg@5
          - type: cosine_ndcg@10
            value: 0.6934417324625011
            name: Cosine Ndcg@10
          - type: cosine_ndcg@100
            value: 0.712214063928892
            name: Cosine Ndcg@100
          - type: cosine_mrr@5
            value: 0.6126666666666668
            name: Cosine Mrr@5
          - type: cosine_mrr@10
            value: 0.618761904761905
            name: Cosine Mrr@10
          - type: cosine_mrr@100
            value: 0.623424850876552
            name: Cosine Mrr@100
          - type: cosine_map@100
            value: 0.6234248508765518
            name: Cosine Map@100
          - type: dot_accuracy@1
            value: 0.43
            name: Dot Accuracy@1
          - type: dot_accuracy@5
            value: 0.87
            name: Dot Accuracy@5
          - type: dot_accuracy@10
            value: 0.92
            name: Dot Accuracy@10
          - type: dot_precision@1
            value: 0.43
            name: Dot Precision@1
          - type: dot_precision@5
            value: 0.17399999999999996
            name: Dot Precision@5
          - type: dot_precision@10
            value: 0.09199999999999997
            name: Dot Precision@10
          - type: dot_recall@1
            value: 0.43
            name: Dot Recall@1
          - type: dot_recall@5
            value: 0.87
            name: Dot Recall@5
          - type: dot_recall@10
            value: 0.92
            name: Dot Recall@10
          - type: dot_ndcg@5
            value: 0.6778743824685509
            name: Dot Ndcg@5
          - type: dot_ndcg@10
            value: 0.6934417324625011
            name: Dot Ndcg@10
          - type: dot_ndcg@100
            value: 0.712214063928892
            name: Dot Ndcg@100
          - type: dot_mrr@5
            value: 0.6126666666666668
            name: Dot Mrr@5
          - type: dot_mrr@10
            value: 0.618761904761905
            name: Dot Mrr@10
          - type: dot_mrr@100
            value: 0.623424850876552
            name: Dot Mrr@100
          - type: dot_map@100
            value: 0.6234248508765518
            name: Dot Map@100

SentenceTransformer based on BAAI/bge-small-en-v1.5

This is a sentence-transformers model finetuned from BAAI/bge-small-en-v1.5. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-small-en-v1.5
  • Maximum Sequence Length: 512 tokens
  • Output Dimensionality: 384 tokens
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("smokxy/embedding-finetuned")
# Run inference
sentences = [
    "What does the term 'shareholder members' refer to?",
    "'Date:  To, (i) The Managing Director Small Farmers' Agri-Business Consortium (SFAC), NCUI Auditorium, August Kranti Marg, Hauz Khas, New Delhi 110016. (ii)The Managing Director National Co-operative Development Corporation (NCDC), 4, Siri Institutional Area, Hauz Khas, New Delhi 110016. (iii) The Chief General Manager National Bank for Agriculture and Rural Development (NABARD), Regional Office --------------------------------------------------------------- (iv) To any other additional Implementing Agency allowed/designated, as the case may be. Sub: Application for Equity Grant under scheme of Formation and Promotion of 10,000  Farmer Producer Organizations (FPOs)  Dear Sir/Madam, We herewith apply for Equity Grant as per the provisions under the captioned scheme.  1. The details of the FPO are as under-   S. No.  Particulars to be furnished  Details  1.   Name of the FPO  2.   Correspondence address of FPO  3.   Contact details of FPO  4.   Registration Number  5.   Date of registration/incorporation of FPO  6.   Brief account of business of FPO  7.   Number of Shareholder Members  8.    Number of Small, Marginal and Landless Shareholder Members'",
    "'19.1   It has been seen, during first two years of implementation of PMFBY, there are various types of yield disputes, which unnecessarily delays the claim settlement. Following figure shows the procedures to  be adopted in various cases.    Figure. Procedures to be followed in different yield dispute cases     19.2   Wherever the yield estimates reported at IU level are abnormally low or high vis-à-vis the general crop  condition the Insurance Company in consultation with State Govt. can make use of various products (e.g. Satellite based Vegetation Index, Weather parameters, etc.) or other technologies (including  statistical test, crop models etc.) to confirm yield estimates. If Insurance Company witnesses any  anomaly/deficiency in the actual yield data(partial /consolidated) received from the State Govt., the  same shall be brought into the notice of concerned State department within 7 days from date of receipt of yield data with specific observations/remarks under intimation to Govt. of India and anomaly, if any, may be resolved  in next 7 days by the  State Level Coordination Committee (SLCC)  headed by Additional Chief Secretary/Principal Secretary/Secretary of the concerned department. This committee shall be authorized to decide all such cases and the decision in such cases shall be final. The SLCC may refer the case to State Level Technical Advisory Committee (STAC) for dispute resolution (Constitution of STAC is defined in Para 19.5). In case the matter stands unresolved even after examination by STAC, it may be escalated to TAC along with all relevant documents including minutes of meetings/records of discussion and report of the STAC and SLCC. Reference to TAC can be made thereafter only in conditions specified in Para 19.7.1 However, data with anomalies which is not reported within 7 days will be treated as accepted to insurance company.'",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 384]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Information Retrieval

Metric Value
cosine_accuracy@1 0.43
cosine_accuracy@5 0.87
cosine_accuracy@10 0.92
cosine_precision@1 0.43
cosine_precision@5 0.174
cosine_precision@10 0.092
cosine_recall@1 0.43
cosine_recall@5 0.87
cosine_recall@10 0.92
cosine_ndcg@5 0.6779
cosine_ndcg@10 0.6934
cosine_ndcg@100 0.7122
cosine_mrr@5 0.6127
cosine_mrr@10 0.6188
cosine_mrr@100 0.6234
cosine_map@100 0.6234
dot_accuracy@1 0.43
dot_accuracy@5 0.87
dot_accuracy@10 0.92
dot_precision@1 0.43
dot_precision@5 0.174
dot_precision@10 0.092
dot_recall@1 0.43
dot_recall@5 0.87
dot_recall@10 0.92
dot_ndcg@5 0.6779
dot_ndcg@10 0.6934
dot_ndcg@100 0.7122
dot_mrr@5 0.6127
dot_mrr@10 0.6188
dot_mrr@100 0.6234
dot_map@100 0.6234

Training Details

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • gradient_accumulation_steps: 4
  • learning_rate: 1e-05
  • weight_decay: 0.01
  • num_train_epochs: 1.0
  • warmup_ratio: 0.1
  • load_best_model_at_end: True

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 8
  • per_device_eval_batch_size: 8
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 4
  • eval_accumulation_steps: None
  • learning_rate: 1e-05
  • weight_decay: 0.01
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 1.0
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • use_ipex: False
  • bf16: False
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: True
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: False
  • hub_always_push: False
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • dispatch_batches: None
  • split_batches: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: False
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: proportional

Training Logs

Epoch Step Training Loss loss val_evaluator_cosine_map@100
0.531 15 0.511 0.1405 0.6234
0.9912 28 - 0.1405 0.6234
  • The bold row denotes the saved checkpoint.

Framework Versions

  • Python: 3.10.14
  • Sentence Transformers: 3.0.1
  • Transformers: 4.41.1
  • PyTorch: 2.3.0+cu121
  • Accelerate: 0.27.2
  • Datasets: 2.19.1
  • Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

GISTEmbedLoss

@misc{solatorio2024gistembed,
    title={GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embedding Fine-tuning}, 
    author={Aivin V. Solatorio},
    year={2024},
    eprint={2402.16829},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}