Questions to LLMs panel @ RL+LLM workshop @ AAAI 2024 - brainstorming and voting

Hallucination is good to some degree to help LLMs extrapolate and generalize. How RL can help with mitigating false hallucinations while encouraging correct hallucinations?

1

Vote

Mohamad

What are challenges and opportunities for NLP researchers with limited resources, e.g., most university profs and PhD students, in the LLMs era? What may be great research topics for them?

1

Vote

Maryam Hashemi

How can reinforcement learning techniques be leveraged to enhance interpretability and explainability in language models, ensuring transparency and trustworthiness in their decision-making processes?

by Maryam Hashemi

0

Vote

What are the main obstacles to improve DPO results vs RLHF? Is overfitting the only issue with DPO (which can be solved by a huge labeled dataset), or does learning the reward and policy simultaneously create other issues?

by Salarrahili

0

Vote

The RLHF framework has challenges with costly data annotators and sparse rewards. How can we leverage offline data to create a denser reward function?

by Elahe Aghapour

0

Vote

How can integrating RL facilitate the development of personalized LLMs for individual user preferences? Furthermore, what implications does it raise regarding privacy concerns?

by Mojtaba

0

Vote

What are the main challenges for LLMs to have solid planning capabilities? How can reinforcement learning help?

0

Vote

How can LLMs weigh the value of seeking more information from the user vs. responding the query with existing information? Thoughts on fusing exploration techniques from reinforcement learning?

0

Vote

How can LLMs personalize themselves through the user feedback to provide a more refined experience for the user?

0

Vote

How do you like approaches like self-generation, self-evaluation, and self-improvements (based on a fixed LLM)? Is it information perpetual motion machine?

0

Vote

Will GPT lead to AGI? Any suggestions for potential alternatives?

0

Vote

Login

Comments

About tricider

The Company

Legal stuff

Get in touch

About Brainstorming

About Crowdsourcing