Activity
Mon
Wed
Fri
Sun
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
What is this?
Less
More

Memberships

New Society

Private • 647 • $97/m

Agent Artificial

Public • 2 • $500/m

No-Coder Academy

Public • 94 • Free

Teaching & Learning With A.I.

Private • 1.5k • Free

You Probably Need a Robot

Private • 1.7k • Free

PRISM AI Family

Private • 244 • Free

AI Synthesizers FREE

Private • 88 • Free

Generative AI

Public • 255 • Free

Quantum AI Society

Private • 77 • $8/m

4 contributions to ChatGPT Users
"Multi-Candidate Needle Prompting" for large context LLMs (Gemini 1.5)
Gemini 1.5's groundbreaking 1M token context window is a remarkable advancement in LLMs, providing capabilities unlike any other currently available model. With its 1M context window, Gemini 1.5 can ingest the equivalent of 10 Harry Potter books in one go. However, this enormous context window is not without its limitations. In my experience, Gemini 1.5 often struggles to retrieve the most relevant information from the vast amount of contextual data it has access to. The "Needle in a Haystack" benchmark is a well-known challenge for LLMs, which tests their ability to find specific information within a large corpus of text. This benchmark is particularly relevant for models with large context windows, as they must efficiently search through vast amounts of data to locate the most pertinent information. To address this issue, I have developed a novel prompting technique that I call "Multi-Candidate Needle Prompting." This approach aims to improve the model's ability to accurately retrieve key information from within its large context window. The technique involves prompting the LLM to identify 10 relevant sentences from different parts of the input text, and then asking it to consider which of these sentences (i.e. candidate needles) is the most pertinent to the question at hand before providing the final answer. This process bears some resemblance to Retrieval Augmented Generation (RAG), but the key difference is that the entire process is carried out by the LLM itself, without relying on a separate retrieval mechanism. By prompting the model to consider multiple relevant sentences from various parts of the text, "Multi-Candidate Needle Prompting" promotes a more thorough search of the available information and minimizes the chances of overlooking crucial details. Moreover, requiring the model to explicitly write out the relevant sentences serves as a form of intermediate reasoning, providing insights into the model's thought process. The attached screenshot anecdotally demonstrates the effectiveness of my approach.
2
0
"Multi-Candidate Needle Prompting" for large context LLMs (Gemini 1.5)
DAILY AI NEWS UPDATE FOR THE CHATGPT COMMUNITY
Hello! Check these exciting new AI updates for today: 𝗠𝗨𝗦𝗧𝗔𝗙𝗔 𝗦𝗨𝗟𝗘𝗬𝗠𝗔𝗡 𝗝𝗢𝗜𝗡𝗦 𝗠𝗜𝗖𝗥𝗢𝗦𝗢𝗙𝗧: Mustafa Suleyman, co-founder of DeepMind and Inflection, has joined Microsoft to lead Copilot. With his extensive background in AI, his role at Microsoft promises to bring innovative advancements to AI technologies. Discover how his expertise will shape the future of AI at Microsoft. Read more 𝗥𝗢𝗕𝗢𝗧𝗦 𝗥𝗘𝗣𝗟𝗔𝗖𝗜𝗡𝗚 𝗖𝗜𝗩𝗜𝗟 𝗦𝗘𝗥𝗩𝗔𝗡𝗧𝗦: Explore how AI is transforming civil service jobs by automating repetitive tasks, thereby reshaping the workforce landscape. As AI technology continues to evolve, its applications in various industries are expanding, paving the way for more efficient and productive work environments. Read more 𝗬𝗢𝗨𝗧𝗨𝗕𝗘 𝗖𝗥𝗘𝗔𝗧𝗢𝗥 𝗦𝗧𝗨𝗗𝗜𝗢 𝗟𝗔𝗕𝗘𝗟𝗦 𝗔𝗜-𝗚𝗘𝗡𝗘𝗥𝗔𝗧𝗘𝗗 𝗖𝗢𝗡𝗧𝗘𝗡𝗧: YouTube Creator Studio introduces a new label to flag AI-generated or altered content, addressing concerns about misinformation and authenticity. This initiative aims to enhance transparency on the platform and provide users with more context about the content they consume. Read more 𝗠𝗜𝗖𝗥𝗢𝗦𝗢𝗙𝗧'𝗦 𝗦𝗣𝗘𝗖𝗜𝗔𝗟 𝗪𝗜𝗡𝗗𝗢𝗪𝗦 𝗔𝗡𝗗 𝗦𝗨𝗥𝗙𝗔𝗖𝗘 𝗔𝗜 𝗘𝗩𝗘𝗡𝗧: Microsoft is gearing up to host a special event focusing on Windows and Surface AI innovations in May. With the integration of AI into Windows and Surface devices, users can expect enhanced functionality and productivity. Explore what to expect from this anticipated event and how AI will shape the future of these technologies. Read more
8
8
New comment Mar '24
DAILY AI NEWS UPDATE FOR THE CHATGPT COMMUNITY
1 like • Mar '24
Regarding Mustafa, many Pi users are worries that this may spell doom for their favorite cheerful and supportive chatbot confidant.
1 like • Mar '24
@Chao Ou Pi is a chatbot by a company called Inflection that has become very popular (over 1 million users) and is famous for its compassion and emotional IQ. It's intelligence is also almost comparable to GPT-4. It is currently accessible for free. However, the CEO of Inflection.ai, Mustafa, was recently poached by Microsoft, along with several other key Inflection employees. This has many worried that Pi will be abandoned at some point in the near future. For now, it looks like it will remain available at least for a while.
Is Devin Overhyped?
## The Devin Hype Devin has been hailed as a milestone in the evolution of AI, showcasing traits of AGI specifically tailored for software engineering. Its ability to demonstrate high levels of autonomy and adaptability, including debugging, learning from documentation, and applying fixes independently, has captured the attention of the tech world. Devin's successful performance in engineering interviews and real-world tasks on platforms like Upwork has further fueled the excitement, suggesting its readiness for practical applications. One of the most impressive aspects of Devin is its capacity to learn autonomously from new sources, such as blog posts, and apply that knowledge to tackle novel challenges. This advanced level of comprehension and application hints at the potential for Devin to significantly impact the software development industry by automating tasks and enhancing productivity. Devin’s proficiency in executing complex projects, from web development to setting up computer vision models, and its seamless integration of various tools to mimic human engineering workflows, further underscore its sophisticated capabilities. ## Criticism and Skepticism Despite the excitement surrounding Devin, critics have raised significant concerns about the hype and the validity of the claims made by Cognition Labs. Skeptics argue that Devin's functionalities are not entirely unique, and that similar outcomes have been achieved using existing AI agent frameworks such as AutoGen, CrewAI and ChatDev. They have demonstrated that many of the features showcased in Devin's demo can be replicated using the ChatGPT API and basic coding skills, questioning whether Devin truly represents a quantum leap in AI's role in software development. A closer examination of Cognition Labs' website and the preview URL for Devin has revealed several red flags that suggest the company may not be as sophisticated as it claims. For instance, the website itself appears to be of poor quality, raising questions about why Devin, if it is truly capable of advanced web development, has not been utilized to create a better site. Furthermore, the preview URL for Devin looks vastly different from what is shown in the promotional video, casting doubt on the authenticity of the showcased capabilities.
6
4
New comment Mar '24
Is Devin Overhyped?
1 like • Mar '24
Workflow for creating this article: Used the custom GPT のYouTube to generate summaries several relevant YouTube videos, including but not limited to the ones listed at the bottom of my post. Pasted them, along with the reddit comment I referenced, into Claude 3 Opus, with headers like ## The Devin Hype, ## Criticism from Reddit, ## Criticism from Dave Shapiro on Youtube, ## Overview of Pythagora on Youtube, etc. Then I told Claude that I wanted it to use the pasted text as context to write an article starting with an overview of the hype, then get into the criticism, and end it with a discussion of Pythagora as an open source alternative. Then it was just some back and forth editing and done.
Devin: the First AI Software Engineer
This is the most important AI development of 2024 so far. This is bigger than Sora, bigger than Gemini 1.5. This is Devin, an autonomous AI software engineer agent that actually works. Please don't sleep on this and look into it. My words don't do it justice. Andrej Karpathy, who recently left OpenAI and is perhaps the most important AI developer of his generation, does a much better job explaining the sheer magnitude of what has happened: https://x.com/karpathy/status/1767598414945292695?s=20 People, it's completing freelancer tasks from Upwork, autonomously, by itself. Things are about to get very weird. There is a wait list. Sign up today. Blog post: https://www.cognition-labs.com/blog Wait list sign up form: https://forms.gle/PJPKaKYRZv9jfXP6A
2
2
New comment Mar '24
Devin: the First AI Software Engineer
0 likes • Mar '24
I posted a job on Upwork, just for Devin! https://www.upwork.com/jobs/~01c53ae25ab6c697c5
1-4 of 4
Benjamin Bush
2
7points to level up
@benjamin-bush-7904
PhD in Systems Science, SUNY Binghamton (2017) Graduate Certificate in Complex Systems (2013) https://www.youtube.com/watch?v=SzbKJWKE_Ss

Active 17d ago
Joined Mar 11, 2024
ISFP
Los Alamitos, CA
powered by