You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

BGEFM-ATM-Avg-v1

This is a sentence-transformers model finetuned from philschmid/bge-base-financial-matryoshka. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Type: Sentence Transformer
Base model: philschmid/bge-base-financial-matryoshka
Maximum Sequence Length: 512 tokens
Output Dimensionality: 768 tokens
Similarity Function: Cosine Similarity
Language: en

Model Sources

Documentation: Sentence Transformers Documentation
Repository: Sentence Transformers on GitHub
Hugging Face: Sentence Transformers on Hugging Face

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel 
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("jdaviescmg/BGEFM-ATM-Avg-v1")
# Run inference
sentences = [
    'Hi',
    '☐ Item 1.01 Entry into a Material Definitive Agreement.\n\nOn\nAugust 21, 2024, Lexaria Bioscience Corp. (the “Company”) entered into a\nCapital on Demand™ Sales Agreement (the “Sales Agreement”) with JonesTrading\nInstitutional Services LLC (the “Agent”), pursuant to which the Company may\nissue and sell, from time to time, up to $20,000,000 in aggregate principal\namount of shares (the “Shares”) of the Company’s common stock, par value\n$0.001 per share, through or to the Agent, as the Company’s sales agent or\nprincipal.\n\nAny Shares to be offered and sold under the Sales Agreement will be\nissued and sold by methods deemed to be an “at-the-market offering” as defined\nin Rule 415(a)(4) promulgated under the Securities Act of 1933, as amended\n(the “Act”), or in negotiated transactions, if authorized by the Company.\n\nSubject to the terms of the Sales Agreement, the Agent will use reasonable\nefforts to sell the Shares from time to time, based upon the Company’s\ninstructions (including any price, time, or size limits or other customary\nparameters or conditions the Company may impose).\n\nThe Company cannot provide\nany assurances that it will issue any Shares pursuant to the Sales Agreement.The Company will pay the Agent a commission of 3.0% of the gross sales price\nof the Shares sold pursuant to the Sales Agreement, if any.\n\nThe Company has\nagreed to reimburse the Agent for certain specified expenses as provided in\nthe Sales Agreement and has also agreed to provide the Agent with customary\nindemnification and contribution rights in respect of certain liabilities,\nincluding liabilities under the Act.\n\nThe Sales Agreement also contains\ncustomary representations, warranties and covenants.The offering of the\nShares will terminate upon the earliest of (a) the issuance and sale of all of\nthe Shares by the Agent on the terms and subject to the conditions set forth\nin the Sales Agreement or (b) the termination of the Sales Agreement by either\nof the parties thereto.',
    'Note 9 – Employee Benefit Plans The Company maintains defined\ncontribution benefit plans under Section 401(k) of the Internal Revenue Code\ncovering substantially all qualified employees of the Company (the “401(k)\nPlan”).\n\nUnder the 401(k) Plan, the Company may make discretionary\ncontributions of up to 100 % of employee contributions.\n\nFor the six months\nended June 30, 2024 and 2023, the Company made contributions to the 401(k)\nPlan of $ 109,000 and $ 95,000 , respectively.Note 10 – Liquidity The Company\nfollows “ Presentation of Financial Statements—Going Concern (Subtopic\n205-40): Disclosure of Uncertainties about an Entity’s Ability to Continue as\na Going Concern ”.\n\nThe Company’s financial statements have been prepared\nassuming that it will continue as a going concern, which contemplates\ncontinuity of operations, realization of assets, and liquidation of\nliabilities in the normal course of business.\n\nAs reflected in the financial\nstatements, the Company has historically incurred a net loss and has an\naccumulated deficit of approximately $ 133,148,000 at June 30, 2024, and net\ncash used in operating activities of approximately $ 1,693,000 for the\nreporting period then ended.\n\nThe Company is implementing its business plan and\ngenerating revenue; however, the Company’s cash position and liquid crypto\nassets are sufficient to support its daily operations over the next twelve\nmonths.Our Form S-3 expired on August 14, 2024.\n\nThe Company filed a new Form\nS-3 on February 14, 2024.\n\nAs a result of SEC comments, the new Form S-3 has\nnot yet gone effective and therefore we may not sell shares under the ATM\nAgreement.Note 11 – Subsequent Events The Company evaluates events that have\noccurred after the balance sheet date but before the financial statements are\nissued.\n\nBased upon the evaluation, the Company did not identify any recognized\nor non-recognized subsequent events that would have required adjustment or\ndisclosure in the financial statements other than disclosed.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]

Evaluation

Metrics

Custom Triplet

Dataset: dim_768
Evaluated with main.CustomTripletEvaluator

Metric	Value
cosine_accuracy	0.75
dot_accuracy	0.25
manhattan_accuracy	0.735
euclidean_accuracy	0.75
max_accuracy	0.75

Custom Triplet

Dataset: dim_512
Evaluated with main.CustomTripletEvaluator

Metric	Value
cosine_accuracy	0.75
dot_accuracy	0.25
manhattan_accuracy	0.735
euclidean_accuracy	0.75
max_accuracy	0.75

Custom Triplet

Dataset: dim_256
Evaluated with main.CustomTripletEvaluator

Metric	Value
cosine_accuracy	0.75
dot_accuracy	0.25
manhattan_accuracy	0.735
euclidean_accuracy	0.75
max_accuracy	0.75

Custom Triplet

Dataset: dim_128
Evaluated with main.CustomTripletEvaluator

Metric	Value
cosine_accuracy	0.75
dot_accuracy	0.25
manhattan_accuracy	0.735
euclidean_accuracy	0.75
max_accuracy	0.75

Custom Triplet

Dataset: dim_64
Evaluated with main.CustomTripletEvaluator

Metric	Value
cosine_accuracy	0.75
dot_accuracy	0.25
manhattan_accuracy	0.735
euclidean_accuracy	0.75
max_accuracy	0.75

Training Details

Training Dataset

Unnamed Dataset

Size: 800 training samples
Columns: sentence1, sentence2, and label
Approximate statistics based on the first 1000 samples:
sentence1 sentence2 label
type string string int
details
min: 3 tokens
mean: 3.0 tokens
max: 3 tokens

min: 35 tokens
mean: 371.57 tokens
max: 512 tokens

0: ~50.00%
1: ~50.00%

	sentence1	sentence2	label
type	string	string	int
details	min: 3 tokens mean: 3.0 tokens max: 3 tokens	min: 35 tokens mean: 371.57 tokens max: 512 tokens	0: ~50.00% 1: ~50.00%

Samples:

sentence1	sentence2	label
`Hi`	8. COMMON STOCK [a] Authorized 150,000,000 authorized common shares, par value of $ 0.001 , and 5,000,000 preferred shares, par value of $ 0.001 . [b] Issued and outstanding shares At-the-Market Sales AgreementOn December 21, 2021, we entered into an At-the-Market Offering Sales Agreement, or ATM, with Virtu Americas, LLC, as sales agent. The ATM was terminated on February 29, 2024, and no further sales of our common stock will be made pursuant to the ATM. Since entry into the ATM, through the date of termination of the ATM, we offered and sold an aggregate of 200,000 shares of our common stock. These aggregate sales resulted in gross proceeds to us of approximately $ 1.5 million. During the three and six months ended June 30, 2024, we did no t sell any shares of our common stock pursuant to the ATM.May 2023 Registered Direct Offering In May 2023, we entered into a securities purchase agreement with certain purchasers, pursuant to which we sold 3,000,000 shares of common stock at a price of $ 5.50 per share in a registered direct offering. The offering of the shares was made pursuant to our shelf registration statement on Form S-3 including the prospectus dated January 5, 2022 contained therein, and the prospectus supplement dated May 25, 2023. We received approximately $ 15.3 million in net proceeds from the registered direct offering after deducting placement agent fees and offering expenses.February 2024 Registered Direct Offering and Concurrent Private PlacementIn February 2024, we entered into a securities purchase agreement with certain purchasers, pursuant to which we sold 13,086,151 shares of common stock at a price of $ 4.585 per share in a registered direct offering. The offering of the shares was made pursuant to our shelf registration statement on Form S-3, including the prospectus dated January 5, 2022 contained therein, and the prospectus supplement dated February 28, 2024.	`1`
`Hi`	The foregoing description of the Note does not purport to be complete and is subject to, and is qualified in its entirety by reference to, the full text of the Note, which is attached as Exhibit 10.1 to this Current Report on Form 8-K, and is incorporated herein by reference.Item 2.03. Creation of a Direct Financial Obligation or an Obligation under an Off-Balance Sheet Arrangement of a Registrant. The disclosure provided in Item 1.01 of this Current Report on Form 8-K is hereby incorporated by reference into this Item 2.03.Item 8.01.Other Events. The Company is supplementing the Company’s risk factors in its Annual Report on Form 10-K filed with the SEC on March 29, 2024, and Quarterly Reports on Form 10-Q for the quarters ended March 31, 2024 and June 30, 2024, filed with the SEC on May 10, 2024 and August 14, 2024, respectively, with the risk factor set forth below.Servicing our debt will require a significant amount of cash, and we may not have sufficient cash flow from our business to pay our debt. Our ability to make scheduled payments of the principal of, to pay interest on or to refinance our indebtedness depends on our future performance, which is subject to economic, financial, competitive and other factors beyond our control. We had, as of June 30, 2024, approximately (i) $16.1 million in working capital, (ii) $2.4 million in cash and cash equivalents, and (iii) $13.6 million of outstanding indebtedness, net of discounts. In addition, on August 15, 2024, we amended and restated the unsecured promissory note and guaranty previously issued to JXVII Trust that increased the principal amount from $7.6 million to $10.0 million.	`0`
`Hi`	The Company incurred costs of approximately $0.9 million related to the execution of the Purchase Agreement. Of the total costs incurred, approximately $0.6 million was paid in Common Stock to Lincoln Park as a commitment fee and $ 0.03 million to reimburse Lincoln Park for expenses. These transaction costs were included in other income / (expenses), net in the consolidated statement of operations. Approximately $ 0.2 million was incurred for legal fees, which were included in administrative and selling expenses on the consolidated statement of operations.During the year ended December 31, 2023, the Company issued and sold an aggregate of 293,509 shares pursuant to the Purchase Agreement and received net proceeds of $ 5.5 million.During the year ended December 31, 2023, the Company incurred approximately $ 0.3 million of expenses, related to the discount on the issuance of common stock to Lincoln Park, which is included in other income / (expenses), net in the consolidated statement of operations. As the Company’s common stock price is below $15.00 per share, the Company is unable to utilize the facility.At the Market Offering Agreement On June 2, 2023, the Company entered into an At The Market Offering Agreement (the “ATM Agreement”) with H.C. Wainwright & Co., LLC, as sales agent (the “Agent”), to create an at-the-market equity program under which it may sell up to $50 million of shares of the Company’s common stock (the “Shares”) from time to time through the Agent (the “ATM Offering”). Under the ATM Agreement, the Agent will be entitled to a commission at a fixed rate of 3.0 % of the gross proceeds from each sale of Shares under the ATM Agreement.	`1`

Loss: MatryoshkaLoss with these parameters:

{
    "loss": "CustomContrastiveLoss",
    "matryoshka_dims": [
        768,
        512,
        256,
        128,
        64
    ],
    "matryoshka_weights": [
        1,
        1,
        1,
        1,
        1
    ],
    "n_dims_per_step": -1
}

Training Hyperparameters

Non-Default Hyperparameters

eval_strategy: epoch
per_device_train_batch_size: 32
per_device_eval_batch_size: 16
gradient_accumulation_steps: 16
learning_rate: 2e-05
num_train_epochs: 10
warmup_ratio: 0.1
use_mps_device: True
optim: adamw_hf

All Hyperparameters

Click to expand

overwrite_output_dir: False
do_predict: False
eval_strategy: epoch
prediction_loss_only: True
per_device_train_batch_size: 32
per_device_eval_batch_size: 16
per_gpu_train_batch_size: None
per_gpu_eval_batch_size: None
gradient_accumulation_steps: 16
eval_accumulation_steps: None
learning_rate: 2e-05
weight_decay: 0.0
adam_beta1: 0.9
adam_beta2: 0.999
adam_epsilon: 1e-08
max_grad_norm: 1.0
num_train_epochs: 10
max_steps: -1
lr_scheduler_type: linear
lr_scheduler_kwargs: {}
warmup_ratio: 0.1
warmup_steps: 0
log_level: passive
log_level_replica: warning
log_on_each_node: True
logging_nan_inf_filter: True
save_safetensors: True
save_on_each_node: False
save_only_model: False
restore_callback_states_from_checkpoint: False
no_cuda: False
use_cpu: False
use_mps_device: True
seed: 42
data_seed: None
jit_mode_eval: False
use_ipex: False
bf16: False
fp16: False
fp16_opt_level: O1
half_precision_backend: auto
bf16_full_eval: False
fp16_full_eval: False
tf32: None
local_rank: 0
ddp_backend: None
tpu_num_cores: None
tpu_metrics_debug: False
debug: []
dataloader_drop_last: False
dataloader_num_workers: 0
dataloader_prefetch_factor: None
past_index: -1
disable_tqdm: False
remove_unused_columns: True
label_names: None
load_best_model_at_end: False
ignore_data_skip: False
fsdp: []
fsdp_min_num_params: 0
fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
fsdp_transformer_layer_cls_to_wrap: None
accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
deepspeed: None
label_smoothing_factor: 0.0
optim: adamw_hf
optim_args: None
adafactor: False
group_by_length: False
length_column_name: length
ddp_find_unused_parameters: None
ddp_bucket_cap_mb: None
ddp_broadcast_buffers: False
dataloader_pin_memory: True
dataloader_persistent_workers: False
skip_memory_metrics: True
use_legacy_prediction_loop: False
push_to_hub: False
resume_from_checkpoint: None
hub_model_id: None
hub_strategy: every_save
hub_private_repo: False
hub_always_push: False
gradient_checkpointing: False
gradient_checkpointing_kwargs: None
include_inputs_for_metrics: False
eval_do_concat_batches: True
fp16_backend: auto
push_to_hub_model_id: None
push_to_hub_organization: None
mp_parameters:
auto_find_batch_size: False
full_determinism: False
torchdynamo: None
ray_scope: last
ddp_timeout: 1800
torch_compile: False
torch_compile_backend: None
torch_compile_mode: None
dispatch_batches: None
split_batches: None
include_tokens_per_second: False
include_num_input_tokens_seen: False
neftune_noise_alpha: None
optim_target_modules: None
batch_eval_metrics: False
batch_sampler: batch_sampler
multi_dataset_batch_sampler: proportional

Training Logs

Epoch	Step	Training Loss	dim_128_cosine_accuracy	dim_256_cosine_accuracy	dim_512_cosine_accuracy	dim_64_cosine_accuracy	dim_768_cosine_accuracy
0.64	1	-	0.555	0.555	0.555	0.555	0.555
1.92	3	-	0.605	0.605	0.605	0.605	0.605
2.56	4	-	0.645	0.645	0.645	0.645	0.645
3.84	6	-	0.68	0.68	0.68	0.68	0.68
4.48	7	-	0.685	0.685	0.685	0.685	0.685
5.76	9	-	0.685	0.685	0.685	0.685	0.685
6.4	10	0.3122	0.675	0.675	0.675	0.675	0.675
0.64	1	-	0.675	0.675	0.675	0.675	0.675
1.92	3	-	0.72	0.72	0.72	0.72	0.72
2.56	4	-	0.72	0.72	0.72	0.72	0.72
3.84	6	-	0.73	0.73	0.73	0.73	0.73
4.48	7	-	0.73	0.73	0.73	0.73	0.73
5.76	9	-	0.725	0.725	0.725	0.725	0.725
6.4	10	0.1092	0.735	0.735	0.735	0.735	0.735
0.64	1	-	0.735	0.735	0.735	0.735	0.735
1.92	3	-	0.73	0.73	0.73	0.73	0.73
2.56	4	-	0.74	0.74	0.74	0.74	0.74
3.84	6	-	0.745	0.745	0.745	0.745	0.745
4.48	7	-	0.75	0.75	0.75	0.75	0.75
5.76	9	-	0.75	0.75	0.75	0.75	0.75
6.4	10	0.0811	0.75	0.75	0.75	0.75	0.75

Framework Versions

Python: 3.12.5
Sentence Transformers: 3.0.1
Transformers: 4.41.2
PyTorch: 2.4.1
Accelerate: 0.34.2
Datasets: 2.19.1
Tokenizers: 0.19.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MatryoshkaLoss

@misc{kusupati2024matryoshka,
    title={Matryoshka Representation Learning}, 
    author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
    year={2024},
    eprint={2205.13147},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Downloads last month: 0

Safetensors

Model size

109M params

Tensor type

F32

Inference Examples

Sentence Similarity

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for cmgx/BGEFM-ATM-Avg-v1

Base model

BAAI/bge-base-en-v1.5

Finetuned

philschmid/bge-base-financial-matryoshka

Finetuned

(4)

this model

Evaluation results

Cosine Accuracy on dim 768
self-reported

0.750
Dot Accuracy on dim 768
self-reported

0.250
Manhattan Accuracy on dim 768
self-reported

0.735
Euclidean Accuracy on dim 768
self-reported

0.750
Max Accuracy on dim 768
self-reported

0.750
Cosine Accuracy on dim 512
self-reported

0.750
Dot Accuracy on dim 512
self-reported

0.250
Manhattan Accuracy on dim 512
self-reported

0.735
Euclidean Accuracy on dim 512
self-reported

0.750
Max Accuracy on dim 512
self-reported

0.750

View on Papers With Code