티스토리

MLZoo

검색하기

블로그 홈

MLZoo

jhn9803.tistory.com/m

ML/DL

구독자: 0

방명록 방문하기

주요 글 목록

[Paper Review] Training Verifiers to Solve Math Word Problems (GSM8K) paper: Cobbe, Karl, et al. "Training verifiers to solve math word problems." arXiv preprint arXiv:2110.14168 (2021).link: https://arxiv.org/abs/2110.14168 Training Verifiers to Solve Math Word ProblemsState-of-the-art language models can match human performance on many tasks, but they still struggle to robustly perform multi-step mathematical reasoning. To diagnose the failures of current models.. 공감수 1 댓글수 0 2025. 1. 8.
[Paper Review] Mistral 7B paper: Jiang, Albert Q., et al. "Mistral 7B." arXiv preprint arXiv:2310.06825 (2023)link: https://arxiv.org/abs/2310.06825 Mistral 7BWe introduce Mistral 7B v0.1, a 7-billion-parameter language model engineered for superior performance and efficiency. Mistral 7B outperforms Llama 2 13B across all evaluated benchmarks, and Llama 1 34B in reasoning, mathematics, and code generation. Our marxiv.org.. 공감수 1 댓글수 0 2024. 12. 22.
[Paper Review] Reflexion: Language Agents with Verbal Reinforcement Learning Paper: Shinn, Noah, et al. "Reflexion: Language agents with verbal reinforcement learning." Advances in Neural Information Processing Systems 36 (2024).link: https://proceedings.neurips.cc/paper_files/paper/2023/hash/1b44b878bb782e6954cd888628510e90-Abstract-Conference.html Reflexion: language agents with verbal reinforcement learningRequests for name changes in the electronic proceedings will b.. 공감수 0 댓글수 0 2024. 12. 7.

문의안내

티스토리
로그인
고객센터

티스토리는 카카오에서 사랑을 담아 만듭니다.