transformers_issues_topics

This is a BERTopic model. BERTopic is a flexible and modular topic modeling framework that allows for the generation of easily interpretable topics from large datasets.

Usage

To use this model, please install BERTopic:

pip install -U bertopic

You can use the model as follows:

from bertopic import BERTopic
topic_model = BERTopic.load("aouichaty/transformers_issues_topics")

topic_model.get_topic_info()

Topic overview

  • Number of topics: 30
  • Number of training documents: 9000
Click here for an overview of all topics.
Topic ID Topic Keywords Topic Frequency Label
-1 bert - tensorflow - pytorch - pretrained - trainer 12 -1_bert_tensorflow_pytorch_pretrained
0 tokenizer - tokenizers - tokenization - tokenize - token 2216 0_tokenizer_tokenizers_tokenization_tokenize
1 gpt2 - gpt2doubleheadsmodel - gpt - gpt2lmheadmodel - distilgpt2 1767 1_gpt2_gpt2doubleheadsmodel_gpt_gpt2lmheadmodel
2 trainer - trainertrain - trainers - training - tf 739 2_trainer_trainertrain_trainers_training
3 modelcard - modelcards - card - model - models 692 3_modelcard_modelcards_card_model
4 gluepy - glue - glueconvertexamplestofeatures - attributeerror - importerror 533 4_gluepy_glue_glueconvertexamplestofeatures_attributeerror
5 summarization - summaries - summary - text - nlp 427 5_summarization_summaries_summary_text
6 s2s - exampless2s - seq2seq - seq2seqtrainer - runseq2seq 345 6_s2s_exampless2s_seq2seq_seq2seqtrainer
7 readmemd - readmetxt - readme - modelcard - file 317 7_readmemd_readmetxt_readme_modelcard
8 typos - typo - fix - correction - fixed 251 8_typos_typo_fix_correction
9 transformerscli - transformers - transformer - transformerxl - importerror 238 9_transformerscli_transformers_transformer_transformerxl
10 deprecate - deprecation - deprecated - warnings - warning 190 10_deprecate_deprecation_deprecated_warnings
11 ner - pipeline - pipelines - nerpipeline - fillmaskpipeline 155 11_ner_pipeline_pipelines_nerpipeline
12 flaxbertformaskedlm - flax - flaxelectraformaskedlm - flaxjax - distilbertmodel 149 12_flaxbertformaskedlm_flax_flaxelectraformaskedlm_flaxjax
13 longformer - longformers - longform - longformerformultiplechoice - longformertokenizerfast 147 13_longformer_longformers_longform_longformerformultiplechoice
14 tests - testing - test - testgeneratefp16 - slow 138 14_tests_testing_test_testgeneratefp16
15 onnx - onnxonnxruntime - onnxexport - 04onnxexport - 04onnxexportipynb 125 15_onnx_onnxonnxruntime_onnxexport_04onnxexport
16 generationbeamsearchpy - generatebeamsearch - generatebeamsearchoutputs - beamsearch - nonbeamsearch 95 16_generationbeamsearchpy_generatebeamsearch_generatebeamsearchoutputs_beamsearch
17 benchmark - benchmarks - accuracy - precision - comparison 74 17_benchmark_benchmarks_accuracy_precision
18 wav2vec2 - wav2vec - wav2vec20 - wav2vec2forctc - wav2vec2xlrswav2vec2 69 18_wav2vec2_wav2vec_wav2vec20_wav2vec2forctc
19 huggingfacemaster - huggingfacetokenizers297 - huggingface - huggingfaces - huggingfacetransformers 52 19_huggingfacemaster_huggingfacetokenizers297_huggingface_huggingfaces
20 prophetnet - prophetnetmodel - prophetnetforcausallm - prophetnetngramattention - prophetnetforconditionalgeneration 52 20_prophetnet_prophetnetmodel_prophetnetforcausallm_prophetnetngramattention
21 notebook - notebooks - blenderbot3b - community - blenderbot 43 21_notebook_notebooks_blenderbot3b_community
22 cachedir - cache - cachedpath - caching - cached 43 22_cachedir_cache_cachedpath_caching
23 closed - add - bort - added - deleted 32 23_closed_add_bort_added
24 electra - electrapretrainedmodel - electraformaskedlm - electraformultiplechoice - electrafortokenclassification 28 24_electra_electrapretrainedmodel_electraformaskedlm_electraformultiplechoice
25 layoutlm - layout - layoutlmtokenizer - layoutlmbaseuncased - tf 25 25_layoutlm_layout_layoutlmtokenizer_layoutlmbaseuncased
26 isort - blackisortflake8 - github - repo - version 16 26_isort_blackisortflake8_github_repo
27 pplm - pr - deprecated - variable - ppl 15 27_pplm_pr_deprecated_variable
28 colab - cola - crashes - crash - tcmalloc 15 28_colab_cola_crashes_crash

Training hyperparameters

  • calculate_probabilities: False
  • language: english
  • low_memory: False
  • min_topic_size: 10
  • n_gram_range: (1, 1)
  • nr_topics: 30
  • seed_topic_list: None
  • top_n_words: 10
  • verbose: True

Framework versions

  • Numpy: 1.23.5
  • HDBSCAN: 0.8.33
  • UMAP: 0.5.3
  • Pandas: 1.5.3
  • Scikit-Learn: 1.2.2
  • Sentence-transformers: 2.2.2
  • Transformers: 4.33.1
  • Numba: 0.56.4
  • Plotly: 5.15.0
  • Python: 3.10.12
Downloads last month
10
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.