In the realm of artificial intelligence, OpenAI is embarking on a journey that transcends the boundaries of traditional computing, redefining the very essence of what a computer can be. This vision extends far beyond the scope of Large Language Models (LLMs) and touches upon several foundational pillars, each poised to revolutionise the way we interact with technology.
AI providers must empower AI to execute tasks within virtual environments, similar to Python or Node/Deno virtual realms.
A glimpse into the future:
OpenAI's ambitious vision begins with the profound understanding of user preferences. The ultimate goal is for AI to know exactly what a user desires, down to the most specific details. This is the moment when technology transforms from being intimidating to becoming genuinely helpful. It's a vision that traces its roots back to October 2011, when Steve Jobs showcased the vision behind Siri. Although it has been unattainable for years, we are finally closing in on that decade old vision.
The power of real-time data:
While a significant portion of an AI's utility is derived from its foundational training and refinement through human feedback, the true potential lies in its ability to tap into real-time, external data sources. Collaborations with platforms like Zapier are the initial steps, but the real change lies in integration with third-party applications and data pipelines. The scope extends beyond minor tasks like "chatting with a PDF."
Unleashing AI's computing potential:
To break free from the constraints of contextual limitations, AI providers must empower AI to execute tasks within virtual environments, similar to Python or Node/Deno virtual realms. This approach allows AI to consume vast amounts of data, akin to traditional computers. While these virtual environments are presently used by data analysts and professionals, they are poised to evolve into long-term data processing hubs, which will transform data analysis and cross-file inference.
The art of agent task and flow planning:
Effective planning hinges on the accurate understanding of user intent. Unraveling intent has been a longstanding challenge, and LLMs have finally cracked the code, providing the keys that were sought for years, with the help of NLP techniques (a technique that now seems incredibly dated!).
Once intent is accurately grasped, the orchestration of an agent planner begins. This process calls for seamless integration with user preferences, third-party data sources, and a profound understanding of computational capabilities.
An ecosystem of expertise:
OpenAI's current focus on ChatGPT is merely the tip of the iceberg. As AI continues to evolve, a diverse array of specialised assistants is set to emerge. Builders will soon have the power to combine multiple tools into complex workflows, while AI itself will learn from the pioneers, heralding a new era of creative integration and innovation.
Enriching memory and experience:
AI embeddings and vector databases offer a foundation, but they lack essential elements such as context switching, conversational centroids, summarisation, and enrichment. The future involves embedding history and persistence, unlocking the potential for long-term memory enriched with pointers to critical subjects, emotions, tone, and more.
Core memory is only the starting point; the ultimate goal is to capture the intricate structure of information that our minds conjure when reminiscing about past experiences.
AI is ushering in a future where users can construct their own workflows and combine APIs, circumventing the need to wait for startups to provide front-end solutions.
Redefining time-bound tasks:
The term "agent" may carry varied interpretations, but its essence lies in tasks that can be scheduled and autonomously completed, regardless of the timeframe. Tasks such as "Let me know when flights from Amsterdam to New York are less than €500" necessitate intricate coordination across API providers and virtual environments in the cloud.
The future of user interfaces:
While text-based chat remains a cornerstone of human-AI interaction, it does not represent the ultimate evolution of user interfaces. Elements such as buttons, date pickers, and images in applications simplify and clarify interactions. AI is poised to become an adept co-pilot, customising its approach to cater to the unique needs of each user. The future of UI is inherently dynamic, adapting to individual optimisation requirements.
AI as a tool composer:
AI is ushering in a future where users can construct their own workflows and combine APIs, circumventing the need to wait for startups to provide front-end solutions. This transition reduces the dependency on apps and startups to generate front-ends, empowering AI to compose a diverse arsenal of tools and APIs, coupled with a gas fee or tax.
A symphony of assistant interactions:
In the foreseeable future, an ecosystem of specialised assistants is set to develop, each contributing and collaborating with other assistants to achieve objectives. In this collaborative environment, assistants must adapt to diverse communication modalities, spanning text, APIs, file systems, and other channels embraced by agents, startups, and humans, as integration evolves to a deeper level.
The future of plugin and app stores:
Specialised assistants come to life through the combination of tools, APIs, prompts, data, preferences, and more. The current OpenAI plugin store is merely the beginning; expect more refinement and expansion as these plugins evolve into essential components of the ecosystem.
This exposition is a glimpse into the major challenges that are still ahead of us. Behind the scenes, a set of problems awaits, encompassing internet search and data scraping, community involvement, dynamic API generation, and integration via various devices such as glasses and earbuds. The realm of AI remains far from saturated, offering boundless prospects for innovation and creativity as we continue to iterate and expand our horizons.
We are at the beginning of a remarkable journey. The future of AI promises to be a thrilling one, brimming with opportunities for those who embrace the ever-evolving landscape. OpenAI is leading the charge for now, but we're just getting started.