Fake It Till You Make It: HyDE, Step-Back Prompts, Hybrid Search, and More

4 min readApr 17, 2025

As you may know already,

“There are a lot of issues with vanilla RAG.”

Are you tired of AI giving you half-baked answers? Frustrated with language models that seem to know everything but your specific data?

You’re not alone. While Retrieval Augmented Generation (RAG) promised to bridge the gap between AI and your proprietary information, many are finding that vanilla RAG just isn’t cutting it. The good news is that there are many advanced techniques that can help us get better results!

But before we expose those limitations and show you how to overcome them, let’s make sure we’re all on the same page.

What is RAG, and Why Should You Care?

RAG, or Retrieval Augmented Generation, is a game-changing technique that connects external data sources to Large Language Models (LLMs). It allows you to ask questions about your own proprietary data and get informed responses. In its simplest form, vanilla RAG works by chunking your documents, embedding them, storing them in a vector database, and then retrieving relevant chunks when a user asks a question.

But remember: vanilla RAG is just the tip of the iceberg. Researchers have introduced different types of RAG improvement techniques that here we are going to cover a few of them.

Vanilla RAG is Holding You Back: Here’s Why You Need to Level Up

While vanilla RAG is a great starting point, it falls short in several key areas. For instance, “When your question is let’s say so-called summarization type questions… or when there is a comparison you want to just compare multiple things against each other… vanilla RAG fails to answer them.

This limitation is just one of many reasons why you need to explore advanced RAG techniques.

Create Fake Questions to Get Real Answers: The Magic of Hypothetical Prompts

One interesting approach is the use of hypothetical questions. Instead of directly passing a user’s query to the vector database, this method generates several relevant questions first.

For instance, “We can generate some hypothetical questions that users may ask in the future and later on when user ask a question we can go and find the most similar hypothetical questions that we have then find the relevant chunks.

This approach allows for a more nuanced search, potentially uncovering information that a direct query might miss.

Fake It Till You Make It: Hypothetical Document Embeddings

Taking the concept of “fake it till you make it” to a whole new level, hypothetical document embeddings involve generating fake responses to a user’s query without any context. In our video today, Mehdi explained:

“We can generate some fake documents… Now that we have this fake document then we can go and pass this document and find relevant chunks from the vector database that are similar or related to this fake document.”

This technique can help bridge the gap between the user’s query language and the language used in your documents, potentially leading to more relevant retrievals.

Curious to learn more?

In the following video, we also covered -

  • step-back prompting,
  • sub-queries,
  • automatic chunk merging,
  • hierarchical indexing and
  • hybrid retrieval and reranking…

Watch the full episode here! 🤓:

👇

Before we go on…a quick announcement -

🚀 Join Our New YouTube Membership Community!

For many of you following us on YouTube. thank you so much for your support! 🦄

In addition to our regular updates, I’m excited to announce the launch of our membership community! Whether you’re looking to master Retrieval-Augmented Generation (RAG), AI Agents, or dive deep into advanced AI projects and tutorials through AI Unbound, there’s something for everyone passionate about AI.

By joining, you’ll gain exclusive content, stay ahead of the curve, and reduce AI FOMO while building real-world skills. Ready to take your AI journey to the next level?

👉 Join the Community Here

Let’s build, learn, and innovate together!

🛠️✨ Happy practicing and happy building! 🚀🌟

Thanks for reading our newsletter. You can follow us here: Angelina Linkedin or Twitter and Mehdi Linkedin or Twitter.

Source of image:
HQ

HyDE

🌈 Our RAG course: https://maven.com/angelina-yang/mastering-rag-systems-a-hands-on-guide-to-production-ready-ai

📚 Also if you’d like to learn more about RAG systems, check out our book on the RAG system: You can download for free on the course site:
https://maven.com/angelina-yang/mastering-rag-systems-a-hands-on-guide-to-production-ready-ai

🦄 Any specific contents you wish to learn from us? Sign up here: https://noteforms.com/forms/twosetai-youtube-content-sqezrz

🧰 Our video editing tool is this one!: https://get.descript.com/nf5cum9nj1m8

📽️ Our RAG videos: https://www.youtube.com/@TwoSetAI

--

--

No responses yet