Model Description

This model predicts the star rating (1 - 5) of a Yelp review based on its text content. It was trained using GPT-2 and BERT, with BERT achieving the best performance at 75% validation accuracy. The model addresses class imbalance using weighted loss and optimizes hyperparameters to enhance generalization.

Training Details

  • Dataset: Yelp Reviews dataset (100,000 samples used)

  • Preprocessing:

    • GPT-2 Tokenizer with Byte-Pair Encoding (BPE) for rare words
    • Truncation (128 tokens) and padding for uniform input size
  • Models Trained:

    • GPT-2: Fine-tuned with a custom classification head, achieving 67% validation accuracy

    • BERT: Fine-tuned with bidirectional attention, achieving 75% validation accuracy

  • Loss Function: Weighted Cross-Entropy Loss to counteract class imbalance

Limitations

  • Performance may degrade on highly informal or extremely short reviews

  • Class imbalance still affects predictions for underrepresented ratings

  • Model was trained on English-language reviews only

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.