Welcome to q-transformer Discussions! #1
Replies: 3 comments 18 replies
-
can anyone working in robotics comment on how significant this work is. I'll prioritize accordingly, thanks! |
Beta Was this translation helpful? Give feedback.
-
I don't work in robotics, but I'll definitely try this out with Mathy when you think it's ready. I dropped the trained (A3C/AlphaZero) mathy agent examples a while ago in favor of swarm planning, but I'm interested in bringing them back to check out transformer-based agents that can work with large discrete action spaces. |
Beta Was this translation helpful? Give feedback.
-
The trouble I have in contributing here is that I'm not sure how Mathy should integrate with LMs yet. Currently, it interfaces with models similar to any other Gym environment: it outputs observations in a floating-point format and only uses text as the input for the math parser that builds an expression tree. For Mathy w/language models, I think perhaps interfacing with DSPy could be interesting, using LMs to handle the conversion of natural language inputs into ASCII-math expressions that Mathy can read and conversion of output states into natural language explanations based on the applied rule and affected expressions. One other idea with Mathy is that you could use it to generate large datasets of step-by-step solutions to problems and use that dataset to fine-tune an LLM. Perhaps it overlaps with RLHF more in that kind of use case? I'm pretty ignorant here since I haven't spent much time with LLMs outside of daily ChatGPT use.
I feel this has been the case for a long time. I don't have much of an audience, but I've tried reaching out to RL people/companies for years now, and it seems like I'm the only one interested in math tutoring with RL 🤷 (or my limited ruleset makes the project appear too basic, or I didn't build a flashy web demo, or something else.)
I saw it come by, but I still need to read it. I've been working with VAEs and Diffusion models lately, and my time for reading research papers is limited. Was it all hype or did it inspire hope? It's on my list for sure 📖 |
Beta Was this translation helpful? Give feedback.
-
👋 Welcome!
We’re using Discussions as a place to connect with other members of our community. We hope that you:
build together 💪.
To get started, comment below with an introduction of yourself and tell us about what you do with this community.
Beta Was this translation helpful? Give feedback.
All reactions