Hello AI Friends,
This week we explored the fascinating world of AI agents with the viral Manus AI framework that’s been making waves across social media.
The discussion around agent frameworks and whether they represent the future of human-AI interaction was particularly thought-provoking.
We also had a deep dive into the technical challenges facing our machine learning engineers working on real-world industrial applications – from waffle production optimization to the broader implications of digital twins in manufacturing.
The evolution of search with AI models and the potential impact on SEO sparked interesting debate, while our exploration of writing tools and MCP protocols highlighted how rapidly the creative landscape is changing.
See you next week,
Harry Verity
Bali AI Meetup host
Co-Founder, AI To The World
Attendees:
- Harry Verity (Host) – Tech journalist, AI consultant, and founder of AI to the World
- Jack – Working in sales, exploring AI tools for sales processes
- Raya – Programmer transitioning into app development
- Brian – Working in blockchain and NFTs
- Sean – Runs a news publishing business with 57 AI-powered sites
- Alina – Founder of a startup helping people achieve roles with AI-generated feedback
- Julian – Background in data analytics and project management
- Mark – Director of data science from Germany
- Toby – Developing a learning app with AI integration
- Gushan – Working on automated investment systems
- Alan – Machine learning engineer specializing in industrial applications
- Dev Chandra (online) – AI automation agency owner, currently in Japan
- Moritz/Flo (online) – AI engineer for technology news publisher
- Nick (online) – Working in AI and analytics at University of Sydney
- Pascal (online) – From Switzerland, interested in electronics and robotics
- Several other participants both in-person and online
Key Topics Discussed:
1. Manus AI Agent Framework Goes Viral
Manus AI, a Chinese agent framework, has gone viral with demonstrations of controlling 50 social media accounts simultaneously, creating a wave of excitement and skepticism in the AI community.
- Multi-agent system built on Claude 3.5 Sonnet (testing 3.7)
- Uses fine-tuned Qwen models (Alibaba’s competitive LLM)
- Invitation-only access, creating significant FOMO in the community
- Claimed to outperform OpenAI’s deep-research feature on GAIA benchmarks
Group Opinions:
- Skepticism about whether the viral video demonstrated real capabilities or was a staged simulation
- Discussion about how agent frameworks would handle CAPTCHAs and other anti-bot measures
- Debate about whether Manus represents a true breakthrough or mostly marketing hype
- Comparisons with OpenAI’s Swarm/Operator which has received less attention recently
- Questions about whether this signals China overtaking Western AI companies in agent technology
2. The Future of Search and SEO
Google’s experimental AI mode with Gemini 2.0 is pushing towards a closed ecosystem similar to OpenAI’s approach, potentially reshaping how people find information online.
Group Opinions:
- Many attendees reported rarely using traditional Google search anymore, preferring to query AI models directly
- Sean shared how his publishing business has adapted by focusing on direct newsletter engagement rather than SEO
- Discussion about the challenges of optimizing prompts for different models and the need to re-engineer prompts when switching models
- Debate about whether traditional SEO will remain relevant or if AI search will fundamentally change how we discover information
3. Mistral AI’s OCR Advancements
Mistral AI has launched a new OCR (Optical Character Recognition) feature claiming 94.89% accuracy compared to Google Document AI’s 83.40% and Azure OCR’s 89.5%.
Group Opinions:
- Discussion about whether 94-95% accuracy is sufficient for critical document processing
- Analysis of multilingual capabilities and special use cases (historical documents, handwritten text)
- Debate about business viability and whether the improvement justifies rescanning existing archives
- Exploration of ensemble approaches that could combine multiple OCR systems to improve accuracy
Technical Deep Dives:
1. Industrial AI Applications
Alan from AIrecruit shared his work on a fascinating real-world industrial application:
- Using IoT sensors and AI to optimize a food production line
- how AI can be used to reduce waste and lower energy consumption in food manufacturing facilities
- Implementing digital twins of manufacturing processes to enable real-time adjustments
- Combining neural networks with symbolic AI for more advanced reasoning capabilities
- Discussion of ROI calculations for industrial AI implementations
2. AI Writing Tools and MCP Integration
Harry demonstrated Butadocs, a writing platform he uses for novel writing:
- Described how Claude integration helps manage characters and plot across a multi-book series
- Discussed limitations of context windows when working on long-form creative content
- Explored the potential of agent frameworks and Model Context Protocol (MCP)
- Community debate about whether custom tools built with APIs are better than subscription services
- Predictions that future apps may exist exclusively through MCP rather than traditional interfaces
3. CAPTCHA and Authentication Challenges
A fascinating discussion about how agent frameworks will navigate security measures:
- Insights from an attendee who works at a CAPTCHA company
- Exploration of the “CAPTCHA solving as a service” black market
- Discussion of potential future approaches like LLM.txt standards for agent-friendly sites
- Debate about how platforms might balance automation accessibility with spam prevention
- Questions about identity validation and KYC approaches for AI agent actions
4. Community Initiative
- Sean proposed a new format where one member presents a specific AI challenge each week for collaborative problem-solving
- Plans to create a more focused announcement-only WhatsApp group for meeting updates
- Discussion about using AI to better moderate the community WhatsApp group
5. Emerging Tools Worth Exploring
- Kling – A website builder that integrates AI to rapidly build and deploy web applications
- Butadocs – Writing platform with AI integration useful for long-form content creation
- GPT-4.5 – Now available to ChatGPT Plus users but extremely expensive for API calls ($75 per million tokens)
- Open Router – A tool for accessing multiple AI models without managing multiple subscriptions
- Lovable.dev – AI-powered platform for building and scaling web applications