Georgia Tech (Georgia Institute of Technology)

university

Verified

https://gatech.edu

GeorgiaTech

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

MichaelR207 authored a paper 9 days ago

Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs

MichaelR207 authored a paper 9 days ago

Mind the Gap! Static and Interactive Evaluations of Large Audio Models

azheng83 updated a model 17 days ago

GeorgiaTech/sonic

View all activity

GeorgiaTech's activity

LLM606

authored a paper about 15 hours ago

COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs

Paper • 2502.17410 • Published 16 days ago

azheng83

updated a model 17 days ago

GeorgiaTech/sonic

Updated 17 days ago

azheng83

published a model 17 days ago

GeorgiaTech/sonic

Updated 17 days ago

Alanturner2

updated a Space about 2 months ago

Arxiv Summarizer

summarize arixv papers and chat with your data

Alanturner2

published a Space about 2 months ago

Arxiv Summarizer

summarize arixv papers and chat with your data

tilmto

authored a paper 4 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 42

clin354

authored a paper 6 months ago

MoDeGPT: Modular Decomposition for Large Language Model Compression

Paper • 2408.09632 • Published Aug 19, 2024

ZhangShenao

updated 7 models 10 months ago

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3

Text Generation • Updated May 13, 2024 • 6

GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3

Text Generation • Updated May 12, 2024 • 9

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2

Text Generation • Updated May 12, 2024 • 94

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1

Text Generation • Updated May 12, 2024 • 91

GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_3

Text Generation • Updated May 12, 2024 • 6

GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_2

Text Generation • Updated May 12, 2024 • 14

GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_1

Text Generation • Updated May 12, 2024 • 95

hbeadles

updated 2 models 11 months ago

GeorgiaTech/bert-generative-pubmedqa

Text2Text Generation • Updated Apr 28, 2024 • 13

GeorgiaTech/scibert-generative-pubmedqa

Text2Text Generation • Updated Apr 26, 2024 • 10 • 1

haotiansun014

authored a paper 11 months ago

ToolQA: A Dataset for LLM Question Answering with External Tools

Paper • 2306.13304 • Published Jun 23, 2023

tarunchy

updated a model about 1 year ago

GeorgiaTech/t5-small-finetuned

Text2Text Generation • Updated Jan 8, 2024 • 21

zefang-liu

authored a paper about 1 year ago

SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security

Paper • 2312.15838 • Published Dec 26, 2023

huckiyang

authored a paper over 1 year ago

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

Paper • 2309.15701 • Published Sep 27, 2023 • 2