4 11 3

ROHITH VENKATA REDDY

knight7561

AI & ML interests

Deep learning, Autonomous Driving

Recent Activity

updated a collection 2 days ago

Post Training

commented on a paper 2 days ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

commented on a paper 12 days ago

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

View all activity

Organizations

Collections 2

spaces 1

Sleeping

First Agent Template

⚡

Fetch ArXiV papers and get local timezone time

models 5

datasets

None public yet

ROHITH VENKATA REDDY

AI & ML interests

Recent Activity

Organizations

Collections 2

Aligning Instruction Tuning with Pre-training

Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Internal Consistency and Self-Feedback in Large Language Models: A Survey

Large Language Models are Zero-Shot Reasoners

Let's Verify Step by Step

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

spaces 1

First Agent Template

models 5

knight7561/SmolLM2_python_coder-FT-ORPO

knight7561/SmolLM2-FT-DPO-python-code

knight7561/SmolLM2_python_coder

knight7561/SmolLM2-eli5_precomputed_top_slice

knight7561/SmolLM2-FT-MyDataset

datasets

ROHITH VENKATA REDDY

AI & ML interests

Recent Activity

Organizations

Collections 2

spaces 1

First Agent Template

models 5 Sort: Recently updated

datasets

models 5