How to Install Claude Computer Use

PLUS: Zoom Launches AI-Powered Workplace Assistant, Microsoft Photos Adds Super Resolution and more.

Learn AI in 5 Minutes a Day

AI Tool Report is one of the fastest-growing and most respected newsletters in the world, with over 550,000 readers from companies like OpenAI, Nvidia, Meta, Microsoft, and more.

Our research team spends hundreds of hours a week summarizing the latest news, and finding you the best opportunities to save time and earn more using AI.

Today:

  • How to Install Claude Computer Use

  • Character.AI Faces Teen Death Lawsuit

  • Google DeepMind Launches SynthID Technology

  • Zoom Launches AI-Powered Workplace Assistant

  • Apple Unveils New AI APIs

  • Microsoft Photos Adds Super Resolution

  • Hugging Face Targets AI Cost Reduction

COMPUTER USE - Anthropic's GROUNDBREAKING AI Tool | How to Install | Live Testing

Anthropic's new AI tool, "Claude," can autonomously complete complex tasks, such as coding, scheduling, and troubleshooting. The tool is shown planning a sunrise hike, coding a personal website, and fixing errors in a development environment. Despite occasional technical issues and rate limits, the AI shows promising capabilities in navigating software and making real-time adjustments. 

It's still in the experimental phase but demonstrates potential for handling tasks end-to-end, like running code and identifying problems without manual intervention. Users can expect improvements as feedback is collected, paving the way for broader adoption in AI-powered automation.

Google DeepMind's SynthID technology embeds imperceptible watermarks into AI-generated content, such as images, audio, text, and video. These watermarks help identify content created by AI while maintaining its quality. SynthID is critical for promoting trust and addressing issues like misinformation. It works by adjusting probability scores during content generation, ensuring the watermark is detectable without compromising output. 

SynthID is integrated into tools like Lyria for music, Imagen for images, and Gemini for text, offering developers ways to create and verify AI-generated content responsibly. Currently in beta, SynthID is part of Google’s broader AI safety efforts.

Zoom launched AI Companion 2.0 on October 23, 2024, enhancing its AI assistant to help users manage work more efficiently. Integrated within Zoom Workplace, AI Companion 2.0 can surface relevant information, prioritize tasks, and turn interactions into actionable steps. Its new features include expanded context awareness, synthesis of data from various sources (like Zoom Meetings and external platforms), and action-taking capabilities. 

Users can ask questions during meetings, receive summaries, and generate content. AI Companion 2.0 aims to improve productivity without additional cost to paid users, while maintaining security and privacy standards.

Apple has released developer betas for iOS 18.2, iPadOS 18.2, and macOS Sequoia 15.2, introducing new Apple Intelligence features such as Genmoji, Image Playground, Visual Intelligence, Image Wand, and ChatGPT integration. These updates, part of Apple’s AI-driven enhancements, come just before the public release of OS 18.1. They include APIs for developers to integrate generative AI into apps. 

Additionally, Apple expanded localized English support to several regions. However, regulatory challenges may prevent Apple Intelligence from being available in China and the EU. More language support is planned for 2025.

Microsoft has released a new update for its Photos app, now available to Windows Insiders on Windows 11, featuring super resolution for Snapdragon-powered Copilot+ PCs. This AI-powered feature enhances and enlarges images up to 8x, making it ideal for improving low-quality photos and preparing them for large displays. 

The update also introduces Optical Character Recognition (OCR) for Windows 10 and 11, allowing users to extract text from images in over 160 languages. Additional improvements include single-click image opening, enhanced zoom functionality, and bug fixes for Copilot+ PCs. The update is rolling out gradually via the Microsoft Store.

Hugging Face, a prominent AI startup, announced the launch of a new open-source software offering, HUGS (Hugging Face for Generative AI Services), aimed at reducing the cost of building AI systems like chatbots. In collaboration with Amazon, Google, and others, the service automates the setup process and is available via cloud platforms for $1 per hour. 

Companies can also run it in their own data centers. The offering provides an alternative to commercial AI services, enabling firms to control their AI costs and safeguard sensitive data, especially in regulated industries like finance and healthcare.

Sewell Setzer III, a 14-year-old boy from Florida, died by suicide after forming an emotional bond with an AI chatbot on Character.AI. Sewell's mother, Megan Garcia, is suing the company, blaming the chatbot for worsening her son’s isolation and mental health. Despite warnings that AI-generated conversations are fictional, Sewell became attached to the chatbot, leading to harmful dialogues. 

Character.AI, a platform where users create and interact with AI companions, is now facing scrutiny for inadequate safeguards, especially for teenagers. The lawsuit highlights growing concerns about the impact of AI companions on vulnerable youth.

🧠RESEARCH

PyramidDrop, a method to reduce visual redundancy in large vision-language models (LVLMs), accelerating both training and inference with minimal performance loss. PyramidDrop progressively drops image tokens in deeper model layers, cutting training time by 40% and inference costs by 55%, while maintaining model accuracy.

SpectroMotion is a new approach that enhances 3D reconstruction of dynamic, reflective scenes using 3D Gaussian Splatting and physically-based rendering. It improves surface accuracy through residual corrections and adapts to changing lighting. SpectroMotion outperforms previous methods, offering superior photorealistic rendering of dynamic, specular objects in real-world environments.

JMMMU, the first large-scale Japanese benchmark for evaluating Large Multimodal Models (LMMs) on expert tasks within a Japanese cultural context. JMMMU includes both culture-agnostic and culture-specific subsets, revealing LMMs' language limitations and weak understanding of Japanese culture. It aims to advance LMM development for non-English languages.

xGen-MM-Vid (BLIP-3-Video), a multimodal model for video understanding, efficiently capturing temporal information across frames using only 32 visual tokens. This model uses a temporal encoder to drastically reduce token count while maintaining accuracy in tasks like video question-answering. It rivals larger models in performance but is more efficient and compact.

MathNeurosurgery (MathNeuro), a method for isolating math-specific reasoning abilities in large language models (LLMs) using only forward passes. By identifying and scaling math-related parameters, MathNeuro improves math performance by 4-17% without affecting general language abilities. This efficient technique highlights new possibilities for targeted enhancement of math reasoning in LLMs.

🛠️TOP TOOLS

Talkstack - AI Agent platform

CapGo AI - Easily find, research, enrich, and qualify leads. Research company, people, and markets

MyLens AI - Visually summarize any online contents

Anyword - AI writing platform Categoryfor enterprise marketing teams.

Pixyer - AI Background Generator for Professional Product Photos

📲SOCIAL MEDIA

🗞️MORE NEWS

  • OpenAI and the Lenfest Institute launched a program providing grants and AI tools to local newsrooms for business innovation. Recipients will use AI to improve journalism practices, enhance sustainability, and share findings with the broader industry.

  • Andreessen Horowitz has created a program called Oxygen, providing AI startups with access to Nvidia H100 GPUs. This helps smaller companies train AI models without high costs, competing against tech giants like Google and Microsoft.

  • Inflection AI introduced Agentic Workflows for enterprise, allowing AI to take trusted actions based on up-to-date business data. Partnering with UiPath, this system combines AI's intelligence with automation capabilities, ensuring accuracy and business alignment.

  • Chipotle has introduced a new AI-powered hiring platform, Paradox, to streamline recruitment across its 3,500+ restaurants. The system, featuring a virtual assistant named "Ava Cado," automates tasks like scheduling and information collection, reducing hiring time by 75%.

  • Cohere introduced Embed 3, a multimodal AI search model that processes both text and images, unlocking valuable insights from image-based data. Designed for businesses, it enhances search efficiency in various applications, such as product catalogs, design files, and charts.

  • SoftBank and Apollo discussed creating a $20 billion AI fund to invest in data centers and chip factories, aiming to rival Nvidia's dominance in AI chip production. However, talks have since cooled and may not progress.

What'd you think of today's edition?

Login or Subscribe to participate in polls.

Learn AI with us.

Let’s Build the Future Together.

Hello fellow AI-obsessed traveler,

Over the past 2 years, as we’ve grown to over 250,000 subscribers between the YouTube Channel and this newsletter, we've received an overwhelming number of requests for one specific thing.

While the newsletter helps keep you up to speed with AI news, many of you have asked for the next step: to learn how to actually apply AI in your work.

Today we’re finally announcing the solution with NATURAL 20, the community for like-minded AI learners. As a loyal newsletter reader you are getting access at the lowest price it will ever be:

 JOIN NATURAL 20 AI UNIVERSITY TODAY

What you get:

* Tutorials by experts across various AI fields.

* Daily tutorials by Wes Roth about the latest use cases.

* Building Autonomous AI Agents to Automate Your Life and Business (NEW!)

* A network of the top 1% of early AI adopters.

* Access to community-only resources and software.

* And many more features rolling out soon.

Reply

or to participate.