Class Robot:

</ OpenAI + Robot = Figure 01 >

With OpenAI, Figure 01 can now have full conversations with people

-OpenAI models provide high-level visual and language intelligence
-Figure neural networks deliver fast, low-level, dexterous robot actions

</ Tesla + Robot = Optimus Gen 2>

A general purpose, bi-pedal, humanoid robot capable of performing tasks that are unsafe, repetitive or boring.


</ Boston Dynamics + Robot = Atlas>

A fully electric Atlas robot designed for real-world applications. The next generation of the Atlas program builds on decades of research to delivering the most capable, useful mobile robots solving the toughest challenges in industry today.

</ Hanson AI + Robot = Sophia>

Sophia is simultaneously a human-crafted science fiction character depicting the future of AI and robotics, and a platform for advanced robotics and AI research.

Class LLM:

</ OpenAI + AI = GPT-4o>

GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. 

It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. 

GPT-4o is especially better at vision and audio understanding compared to existing models.

< / Anthropic + AI = Claude 3>

The Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks, includes three powerful AI models.

  • Haiku (Light & Fast)
    The fastest model that can execute lightweight actions, with industry-leading speed. speed.
  • Sonnet (Hard-working)
    best combination of performance and speed for efficient, high-throughput tasks.


  • Opus (Powerful)
    The most intelligent model, which can handle complex analysis, longer tasks with multiple steps, and higher-order math and coding tasks.

</ Google + AI = Gemini 1.5 Pro>

Gemini 1.5 Pro is a foundation model that performs well at a variety of multimodal tasks such as visual understanding, classification, summarization, and creating content from image, audio and video. It’s adept at processing visual and text inputs such as photographs, documents, infographics, and screenshots.


Gemini 1.5 Pro has massive context understanding with up to 1 million tokens, making it more efficient at exploring, analyzing, and understanding large data sets and documents up to 1,500 pages.

</ Meta + AI = Llama 3>

 Llama 3, an openly accessible model that excels at language nuances, contextual understanding, and complex tasks like translation and dialogue generation. With enhanced scalability and performance, Llama 3 can handle multi-step tasks effortlessly. Additionally, it drastically elevates capabilities like reasoning, code generation, and instruction following. Build the future of AI with Llama 3.

