DreamMr commited on
Commit
83d6f81
·
verified ·
1 Parent(s): dacd914

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +63 -3
README.md CHANGED
@@ -1,3 +1,63 @@
1
- ---
2
- license: unknown
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: unknown
3
+ base_model:
4
+ - BleachNick/MMICL-Instructblip-T5-xxl
5
+ ---
6
+
7
+ # Model Card
8
+
9
+ [**🌐Homepage**](https://github.com/DreamMr/WisdoM) | [**📖 Paper**](https://dl.acm.org/doi/abs/10.1145/3664647.3681403)
10
+
11
+ We trained using MMICL on the MSED training set for research in the field of multimodal sentiment analysis.
12
+
13
+ ## Training Details
14
+
15
+ Our training code is sourced from: [here](https://github.com/HaozheZhao/MIC/blob/master/run_script/flickr/deep_speed_instructblip_t5xxl.sh).
16
+
17
+ - data format
18
+
19
+ ```
20
+ {
21
+ "text": "Sentence: \"An overweight Hispanic woman and a young mixed race Hispanic and Caucasian man exercising together outdoors in an urban setting, running or jogging. They are smiling, looking at each other as they exercise.\". Use the image 0: <image0>图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图图 as a visual aids to help you answer the question. Question: according to the image 0 and sentence, what is the sentiment polarity? Choose from the following options:\nA).positive\nB).neutral\nC).negative\n\nAnswer: ",
22
+ "image": "xxx/1.jpg",
23
+ "aspect": "",
24
+ "label": "neutral"
25
+ }
26
+ ```
27
+
28
+
29
+
30
+ - Hyperparameters
31
+
32
+ | | batch size | learning rate | epoch |
33
+ | ---------- | ---------- | ------------- | ----- |
34
+ | MMICL-MSED | 4 | 1e-4 | 3 |
35
+
36
+
37
+
38
+ ## Evaluation on MSED dataset
39
+
40
+ | | Precision | Recall | Mac-F1 |
41
+ | ---------------------- | --------- | -------- | -------- |
42
+ | DCNN | 59.3 | 53.0 | 51.2 |
43
+ | BiLSTM | 78.4 | 78.8 | 78.6 |
44
+ | DCNN+AlexNet | 71.0 | 70.1 | 70.3 |
45
+ | BiLSTM+AlexNet | 78.7 | 79.2 | 78.9 |
46
+ | BERT+AlexNet | 83.2 | 83.1 | 83.2 |
47
+ | Multimodal Transformer | 83.6 | 83.5 | 83.5 |
48
+ | **MMICL** | **89.4** | **86.4** | **87.1** |
49
+
50
+ The evaluation code 👉 [here](https://github.com/DreamMr/WisdoM)
51
+
52
+ ## Citation
53
+
54
+ ```
55
+ @inproceedings{wang2024wisdom,
56
+ title={Wisdom: Improving multimodal sentiment analysis by fusing contextual world knowledge},
57
+ author={Wang, Wenbin and Ding, Liang and Shen, Li and Luo, Yong and Hu, Han and Tao, Dacheng},
58
+ booktitle={Proceedings of the 32nd ACM International Conference on Multimedia},
59
+ pages={2282--2291},
60
+ year={2024}
61
+ }
62
+ ```
63
+