ROHITH VENKATA REDDY
knight7561
AI & ML interests
Deep learning, Autonomous Driving
Recent Activity
updated
a collection
2 days ago
Post Training
commented on
a paper
2 days ago
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model
commented on
a paper
12 days ago
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Organizations
Collections
2
Papers dump of LLM Reasoning domain
-
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Paper • 2407.14507 • Published • 46 -
Large Language Models are Zero-Shot Reasoners
Paper • 2205.11916 • Published • 1 -
Let's Verify Step by Step
Paper • 2305.20050 • Published • 10 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 11
models
5
knight7561/SmolLM2_python_coder-FT-ORPO
Text Generation
•
Updated
•
11
knight7561/SmolLM2-FT-DPO-python-code
Text Generation
•
Updated
•
10
knight7561/SmolLM2_python_coder
Text Generation
•
Updated
•
61
knight7561/SmolLM2-eli5_precomputed_top_slice
Text Generation
•
Updated
•
32
knight7561/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
14
datasets
None public yet