Feedle - Ai - hiroppy's site

RSS Feeds:

Last updated: 2025/07/17 14:01

langchain-ollama==0.3.5

この記事は、langchain-ollamaのバージョン0.3.5のリリースに関する情報を提供しています。このリリースは2022年7月16日に行われ、主な変更点として、async OllamaEmbeddingsメソッドにおけるnum_gpuパラメータの不具合が修正されました。これにより、GPUの数を指定する際の問題が解決され、より効率的な処理が可能になります。 • langchain-ollamaのバージョン0.3.5がリリースされた • リリース日は2022年7月16日 • async OllamaEmbeddingsメソッドにおけるnum_gpuパラメータの不具合が修正された • 修正によりGPUの数を正しく指定できるようになった

langchain-ai/langchain 2025/07/16

release tool

Accenture scales video analysis with Amazon Nova and Amazon Bedrock Agents

This post was written with Ilan Geller, Kamal Mannar, Debasmita Ghosh, and Nakul Aggarwal of Accenture. Video highlights offer a powerful way to boost audience engagement and extend content value for content publishers. These short, high-impact clips capture key moments that drive viewer retention, amplify reach across social media, reinforce brand identity, and open new […]

AWS Machine Learning Blog 2025/07/16

tool

No Image

Voxtral

Mistral released their first audio-input models yesterday: Voxtral Small and Voxtral Mini. These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B …

Simon Willison's Blog 2025/07/16

api tool

No Image

common-pile/caselaw_access_project

Enormous openly licensed (I believe this is almost all public domain) training dataset of US legal cases: This dataset contains 6.7 million cases from the Caselaw Access Project and Court …

Simon Willison's Blog 2025/07/16

api cloud tool

Deploy conversational agents with Vonage and Amazon Nova Sonic

In this post, we explore how developers can integrate Amazon Nova Sonic with the Vonage communications service to build responsive, natural-sounding voice experiences in real time. By combining the Vonage Voice API with the low-latency and expressive speech capabilities of Amazon Nova Sonic, businesses can deploy AI voice agents that deliver more human-like interactions than traditional voice interfaces. These agents can be used as customer support, virtual assistants, and more.

AWS Machine Learning Blog 2025/07/16

api cloud tool

LangSmith and LangGraph Platform are now available in AWS Marketplace

LangSmith and LangGraph Platform (self-hosted deployments) are now available in AWS Marketplace.

LangChain Blog 2025/07/16

platform tool

2025-07-15

この記事は、mastra-aiのリリースノートに関するもので、2025年7月15日に行われた更新内容を詳述しています。主な変更点には、プレイグラウンドにおける作業メモリ機能の追加、推論を表示する機能の実装、エージェントネットワークのリクエストルーティングの修正、クライアントSDKの互換性向上、メモリ機能のベンチマーク準備、エラー処理の改善などが含まれています。また、メモリ設定の更新ロジックの改善や、パフォーマンス向上に関する実験的な機能も紹介されています。これらの変更は、ユーザーがエージェントとインタラクションする際の体験を向上させることを目的としています。 • 作業メモリ機能の追加により、ユーザーはエージェントとのインタラクション中に作業メモリを表示・編集できるようになった。 • 推論を表示する機能がプレイグラウンドインターフェースに追加された。 • エージェントネットワークのリクエストがvNextエージェントネットワークに正しくルーティングされるよう修正された。 • クライアントSDKでのcrypto.randomUUIDの使用が修正され、互換性の問題が解決された。 • メモリ機能のベンチマーク準備が行われ、メモリ機能の評価が可能になった。 • エラー処理が改善され、OpenAIRealtimeVoiceがOpenAIからのエラーを適切に処理できるようになった。 • 実験的なメモリ機能の改善により、パフォーマンスが20%向上した。

mastra-ai/mastra 2025/07/16

library release tool

More advanced AI capabilities are coming to Search

For Google AI Pro and AI Ultra subscribers, AI Mode in Search now features the ability to use Gemini 2.5 Pro and do deeper research for you.

Google AI Blog 2025/07/16

api tool

Open Deep Research

TL;DR Deep research has broken out as one of the most popular agent applications. OpenAI, Anthropic, Perplexity, and Google all have deep research products that produce comprehensive reports using various sources of context. There are also many open source implementations. We've built an open deep researcher that is simple

LangChain Blog 2025/07/16

api tool

Enabling customers to deliver production-ready AI agents at scale

Today, I’m excited to share how we’re bringing this vision to life with new capabilities that address the fundamental aspects of building and deploying agents at scale. These innovations will help you move beyond experiments to production-ready agent systems that can be trusted with your most critical business processes.

AWS Machine Learning Blog 2025/07/16

tool

How to build unified AI interfaces using the Vercel AI SDK

Learn how to use the Vercel AI SDK to build modern, multimodal frontend apps with streaming, function calling, image analysis, voice output, and generative UI.

logrocket-dev 2025/07/16

library tool

Here's how to make these in ChatGPT

1. Upload a picture of yourself/subject using the ✚ button in ChatGPT. 2. Ask chat for: ➡️ A black and white close-up portrait with visible water droplets and small bubbles on the face like the subject just emerged from water. The mood should feel intense and cinematic, with a dark, minimal background.

YouTube OpenAI 2025/07/16

0.49.0 - 2025-07-16

この記事は、OpenHandsのバージョン0.49.0のリリースノートを提供しています。このリリースでは、CLIとVSCodeの統合が追加され、OpenHands Cloudを通じてLLM用のプロバイダーが導入されました。また、新しいメモリUI機能が追加され、会話カードにブランチ名とGitプロバイダーが表示されるようになりました。CLIの初回実行時にエイリアスを設定する機能も追加され、ユーザーがコマンドを簡単に実行できるようになっています。さらに、いくつかのバグ修正やUIの改善が行われ、全体的な安定性が向上しました。 • CLIとVSCodeの統合が追加された • OpenHands Cloudを通じてLLM用のプロバイダーが導入された • 新しいメモリUI機能が追加された • 会話カードにブランチ名とGitプロバイダーが表示されるようになった • CLIの初回実行時にエイリアス設定機能が追加された • いくつかのバグ修正が行われ、UIの改善が実施された

All-Hands-AI/OpenHands 2025/07/16

release tool

ModernBERT Decoder (based on v4.53.2)

この記事では、Hugging FaceのTransformersライブラリに新たに追加されたModernBERT Decoderモデルについて説明しています。このモデルは、v4.53.2リリースに基づいており、自己回帰的なテキスト生成タスクに特化したデコーダーアーキテクチャを持っています。ModernBERT Decoderは、ロタリーポジショナルエンコーディングや、8192トークンまでのシーケンスをサポートするための現代的なアーキテクチャの改善を取り入れています。インストールは、指定されたコマンドを使用して行うことができ、今後のマイナーリリースv4.54.0に含まれる予定です。使用例として、テキスト生成やテキスト分類のためのパイプラインの利用方法が示されています。 • 新しいモデルModernBERT DecoderがTransformersに追加された • ModernBERT Decoderは自己回帰的なテキスト生成タスクに特化している • ロタリーポジショナルエンコーディングを使用し、8192トークンまでのシーケンスをサポート • インストールは特定のコマンドを使用して行う • 今後のリリースv4.54.0に含まれる予定 • テキスト生成やテキスト分類の使用例が提供されている

huggingface/transformers 2025/07/16

library release

checkpointpostgres==2.0.23

この記事は、GitHub上のlangchain-aiリポジトリにおけるcheckpointpostgresのバージョン2.0.23のリリースについて説明しています。このリリースでは、checkpoint_blobsテーブルへの書き込みを削減するパフォーマンス改善が行われました。また、依存関係のアップグレードが行われ、いくつかのドキュメントのコメントにおける誤字も修正されています。これにより、全体的な効率が向上し、より安定した動作が期待されます。 • checkpointpostgresのバージョン2.0.23がリリースされた • checkpoint_blobsテーブルへの書き込みを減らすパフォーマンス改善が行われた • 依存関係のアップグレードが実施された • ドキュメント内の誤字が修正された • 全体的な効率と安定性の向上が期待される

langchain-ai/langgraph 2025/07/16

release tool

Google France hosted a hackathon to tackle healthcare's biggest challenges

Doctors, developers and researchers gathered in Paris to prototype new medical solutions using Google’s AI models.

Google AI Blog 2025/07/16

framework tool

How to support new VLMs into SGLang: A Case Study with NVILA

<p>The world of LLMs is evolving at a remarkable pace, with Visual Language Models (VLMs) at the forefront of this revolution. These models power application...

LMSYS Blog 2025/07/16

api cloud tool

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/16

tool

Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store

Amazon Bedrock Knowledge Bases has extended its vector store options by enabling support for Amazon OpenSearch Service managed clusters, further strengthening its capabilities as a fully managed Retrieval Augmented Generation (RAG) solution. This enhancement builds on the core functionality of Amazon Bedrock Knowledge Bases , which is designed to seamlessly connect foundation models (FMs) with internal data sources. This post provides a comprehensive, step-by-step guide on integrating an Amazon Bedrock knowledge base with an OpenSearch Service managed cluster as its vector store.

AWS Machine Learning Blog 2025/07/15

api tool

Monitor agents built on Amazon Bedrock with Datadog LLM Observability

We’re excited to announce a new integration between Datadog LLM Observability and Amazon Bedrock Agents that helps monitor agentic applications built on Amazon Bedrock. In this post, we'll explore how Datadog's LLM Observability provides the visibility and control needed to successfully monitor, operate, and debug production-grade agentic applications built on Amazon Bedrock Agents.

AWS Machine Learning Blog 2025/07/15

api tool

How PayU built a secure enterprise AI assistant using Amazon Bedrock

PayU offers a full-stack digital financial services system that serves the financial needs of merchants, banks, and consumers through technology. In this post, we explain how we equipped the PayU team with an enterprise AI solution and democratized AI access using Amazon Bedrock, without compromising on data residency requirements.

AWS Machine Learning Blog 2025/07/15

tool

Claude Connectors

Claude now connects to your favorite tools with one click. Browse and connect to Canva, Figma, Notion, Stripe, and more. Now Claude can see your projects, understand your deadlines, and work directly in your tools.

YouTube Anthropic 2025/07/15

langchain-core==0.3.69

この記事は、Langchainのコアライブラリのバージョン0.3.69のリリースに関するもので、主な変更点や新機能について説明しています。新機能として、デシリアライズをより許容的にするオプションが追加され、PipelinePromptTemplateの非推奨通知が文書に追加されました。また、BaseChatPromptTemplateの戻り値の型ヒントが修正され、クエリベクトルや埋め込みにNaN値が含まれている場合のエラーメッセージが追加されました。さらに、Ruffルールの追加やテストの改善も行われています。これらの変更は、Langchainの機能性と安定性を向上させることを目的としています。 • デシリアライズをより許容的にするオプションが追加された • PipelinePromptTemplateの非推奨通知が追加された • BaseChatPromptTemplateの戻り値の型ヒントが修正された • クエリベクトルや埋め込みにNaN値が含まれる場合のエラーメッセージが追加された • Ruffルールの追加やテストの改善が行われた

langchain-ai/langchain 2025/07/15

library release

No Image

Reflections on OpenAI

Calvin French-Owen spent just over a year working at OpenAI, during which time the organization grew from 1,000 to 3,000 people and Calvin found himself in "the top 30% by …

Simon Willison's Blog 2025/07/15

api library tool

What's an AI Agent?

What can you expect from AI agents? OpenAI COO Brad Lightcap explains on episode 3 of the OpenAI Podcast—out now.

YouTube OpenAI 2025/07/15

Brad Lightcap and Ronnie Chatterji on jobs, growth, and the AI economy — the OpenAI Podcast Ep. 3

The future of work is arriving faster than expected. In this episode, OpenAI COO Brad Lightcap and Chief Economist Ronnie Chatterji join Andrew Mayne to discuss the impacts of AI on software, science, small business, education, and jobs. 00:00 Intro 02:00 Birth of ChatGPT: from playground to product 06:15 AI’s impact on work & productivity 08:55 Supercharging science with AI 09:55 Small teams with big leverage 13:10 What sectors are next? 17:05 Defining AI agents 22:08 AI in emerging markets & agriculture 25:53 Return of the “Idea Guy” 28:20 Why EQ and soft skills matter 31:35 Education for the AI era 36:11 Partnering with Cal State & educators 39:14 From bans to buy-in in schools 42:00 Ronnie’s research: sectors, geography, communication 45:46 What should we tell our kids? 48:14 What history teaches us about disruption 52:04 Expanding participation in the economy 55:35 AI increases demand 59:19 Why OpenAI will grow after AGI 1:02:05 Favorite ChatGPT use cases

YouTube OpenAI 2025/07/15

The next wave of AI for content creation includes digital twins

AI and digital twins transform CPG marketing with scalable, cost-effective, personalized content creation. Learn more.

Microsoft AI Blog 2025/07/15

cloud tool

No Image

xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated"

They continue: One was that if you ask it "What is your surname?" it doesn't have one so it searches the internet leading to undesirable results, such as when its …

Simon Willison's Blog 2025/07/15

tool

Supercharge generative AI workflows with NVIDIA DGX Cloud on AWS and Amazon Bedrock Custom Model Import

This post is co-written with Andrew Liu, Chelsea Isaac, Zoey Zhang, and Charlie Huang from NVIDIA. DGX Cloud on Amazon Web Services (AWS) represents a significant leap forward in democratizing access to high-performance AI infrastructure. By combining NVIDIA GPU expertise with AWS scalable cloud services, organizations can accelerate their time-to-train, reduce operational complexity, and unlock […]

AWS Machine Learning Blog 2025/07/15

cloud tool

Accelerate generative AI inference with NVIDIA Dynamo and Amazon EKS

This post introduces NVIDIA Dynamo and explains how to set it up on Amazon EKS for automated scaling and streamlined Kubernetes operations. We provide a hands-on walkthrough, which uses the NVIDIA Dynamo blueprint on the AI on EKS GitHub repo by AWS Labs to provision the infrastructure, configure monitoring, and install the NVIDIA Dynamo operator.

AWS Machine Learning Blog 2025/07/15

cloud tool

Moonshot AI's Kimi K2 model is now supported in Vercel AI Gateway

You can now access Kimi K2 from Moonshot AI using Vercel's AI Gateway, with no Moonshot AI account required.

Vercel Blog 2025/07/15

api cloud tool

AWS doubles investment in AWS Generative AI Innovation Center, marking two years of customer success

In this post, AWS announces a $100 million additional investment in its AWS Generative AI Innovation Center, marking two years of successful customer collaborations across industries from financial services to healthcare. The investment comes as AI evolves toward more autonomous, agentic systems, with the center already helping thousands of customers drive millions in productivity gains and transform customer experiences.

AWS Machine Learning Blog 2025/07/15

cloud tool

AI compliance: A core product competency you shouldn’t skip

AI governance is now a product feature. Learn how to embed trust, transparency, and compliance into your build cycles.

logrocket-dev 2025/07/15

api tool

Release v3.23.12

この記事は、RooCodeIncのGitHubリポジトリにおけるリリースv3.23.12について説明しています。このリリースは2025年7月15日に行われ、主にモデルパラメータにおけるmax-token計算の更新が含まれています。この更新は、Kimi K2などのモデルをより良くサポートすることを目的としています。リリースノートには、特定の変更点や改善点が記載されていますが、具体的な詳細は示されていません。 • リリースv3.23.12は2025年7月15日に行われた。 • max-token計算の更新が含まれている。 • この更新はKimi K2などのモデルをサポートするためのものである。 • 具体的な変更点や改善点はリリースノートに記載されている。

RooCodeInc/Roo-Code 2025/07/15

release tool

AWS のエージェント IDE Kiro を使ってみた

Kiro は AWS が開発した IDE 内蔵型の AI コーディングエージェントです。Kiro の特徴は単なるバイブコーディングにとどまらず、スペックを使用して仕様駆動開発でアプリケーションを開発できることです。この記事では Kiro を使ったアプリケーション開発の流れを紹介します。

azukiazusa のテックブログ2 2025/07/15

library tool

A summer of security: empowering cyber defenders with AI

Here’s what we’re announcing at cybersecurity conferences like Black Hat USA and DEF CON 33.

Google AI Blog 2025/07/15

security tool

Release v3.23.11

この記事は、RooCodeIncのGitHubリポジトリにおけるリリースv3.23.11について説明しています。このリリースでは、Kimi K2モデルがGroqに追加され、コンテキスト圧縮の数学に関する修正が行われました。また、前のモードに切り替えるためのCmd+Shift+.というキーボードショートカットも追加されています。リリース日は2025年7月14日で、GitHubの検証済み署名が付与されています。 • Kimi K2モデルがGroqに追加された • コンテキスト圧縮の数学に関する修正が行われた • Cmd+Shift+.のキーボードショートカットが追加された • リリース日は2025年7月14日 • GitHubの検証済み署名が付与されている

RooCodeInc/Roo-Code 2025/07/15

release tool

Intellectual freedom by design

ChatGPT is designed to be useful, trustworthy, and adaptable so you can make it your own.

OpenAI Blog 2025/07/15

tool

Application development without programmers

This book by James Martin, published in 1982, includes the following in the preface: Applications development did not change much for 20 years, but now a new wave is crashing …

Simon Willison's Blog 2025/07/14

api framework tool

Release v3.23.10

RooCodeIncのGitHubリポジトリで公開されたリリースv3.23.10は、2025年7月14日に行われたもので、主に2つの変更が含まれています。まず、組み込みモデルの次元をカスタム次元よりも優先するように変更されました。次に、インデックスモデルオプションにパディングが追加されました。これらの変更は、ユーザーからのフィードバックに基づいて行われたもので、特に@daniel-lxsによる貢献が挙げられています。 • 組み込みモデルの次元をカスタム次元より優先する変更 • インデックスモデルオプションにパディングを追加 • ユーザーからのフィードバックに基づく改善 • @daniel-lxsによる貢献が含まれている

RooCodeInc/Roo-Code 2025/07/14

release tool

0.5.3

この記事は、GitHub上のlangchain-ai/langgraphリポジトリのバージョン0.5.3のリリースノートについて説明しています。このリリースでは、依存関係のアップグレード、ドキュメントの修正、いくつかのバグ修正が行われました。具体的には、PregelProtocolに関するABC仕様の削除、StateGraphへのアクセス時の_state_schemaの置き換え、PostgresチェックポイントにおけるPythonの無効なエスケープ警告の削除が含まれています。また、READMEにフォーラムへのリンクが追加されました。 • 依存関係のアップグレードが行われた • PregelProtocolに関するABC仕様が削除された • StateGraphへのアクセス時に_state_schemaがstate_schemaに置き換えられた • PostgresチェックポイントでのPythonの無効なエスケープ警告が修正された • READMEにフォーラムへのリンクが追加された

langchain-ai/langgraph 2025/07/14

release tool

Release v3.23.9

RooCodeIncのリリースv3.23.9では、Claude CodeプロバイダーがWindows上でネイティブに動作するように対応し、コマンド実行のための設定可能なタイムアウトが追加されました。また、code-indexサービスにgemini-embedding-001モデルが追加され、埋め込みモデルを切り替える際のベクトル次元不一致エラーが解決されました。さらに、execツールの応答に現在の作業ディレクトリ(cwd)が返されるようになり、後続の呼び出しでモデルが失われないように改善されています。 • Claude CodeプロバイダーがWindowsでネイティブに動作するように対応 • コマンド実行のための設定可能なタイムアウトが追加された • code-indexサービスにgemini-embedding-001モデルが追加された • 埋め込みモデルを切り替える際のベクトル次元不一致エラーが解決された • execツールの応答に現在の作業ディレクトリ(cwd)が返されるようになった

RooCodeInc/Roo-Code 2025/07/14

release tool

Discover tools that work with Claude

We launched a new directory of tools that connect directly to Claude. Connect Claude to Notion, Canva, Figma, Stripe, and more in one click. Claude can see your projects, understand your deadlines, and work directly in your tools.

YouTube Anthropic 2025/07/14

No Image

ccusage

Claude Code logs detailed usage information to the ~/.claude/ directory. ccusage is a neat little Node.js tool which reads that information and shows you a readable summary of your usage …

Simon Willison's Blog 2025/07/14

tool

Build AI-driven policy creation for vehicle data collection and automation using Amazon Bedrock

Sonatus partnered with the AWS Generative AI Innovation Center to develop a natural language interface to generate data collection and automation policies using generative AI. This innovation aims to reduce the policy generation process from days to minutes while making it accessible to both engineers and non-experts alike. In this post, we explore how we built this system using Sonatus’s Collector AI and Amazon Bedrock. We discuss the background, challenges, and high-level solution architecture.

AWS Machine Learning Blog 2025/07/14

api tool

How Rapid7 automates vulnerability risk scores with ML pipelines using Amazon SageMaker AI

In this post, we share how Rapid7 implemented end-to-end automation for the training, validation, and deployment of ML models that predict CVSS vectors. Rapid7 customers have the information they need to accurately understand their risk and prioritize remediation measures.

AWS Machine Learning Blog 2025/07/14

api tool

Build secure RAG applications with AWS serverless data lakes

In this post, we explore how to build a secure RAG application using serverless data lake architecture, an important data strategy to support generative AI development. We use Amazon Web Services (AWS) services including Amazon S3, Amazon DynamoDB, AWS Lambda, and Amazon Bedrock Knowledge Bases to create a comprehensive solution supporting unstructured data assets which can be extended to structured data. The post covers how to implement fine-grained access controls for your enterprise data and design metadata-driven retrieval systems that respect security boundaries. These approaches will help you maximize the value of your organization's data while maintaining robust security and compliance.

AWS Machine Learning Blog 2025/07/14

api cloud security

langchain-openai==0.3.28

この記事は、langchain-openaiのバージョン0.3.28のリリースに関する情報を提供しています。このリリースでは、OpenAIに関連するいくつかの修正と更新が行われました。具体的には、コンピュータ使用時の安全性チェックをサポートする修正や、SDKのバージョンアップ、Grok 4に関するドキュメントの更新が含まれています。また、コードの品質向上のためにruffのルールが追加され、問題を自動的に修正する機能も実装されています。これにより、開発者はより安全で効率的なコーディングが可能になります。 • OpenAIに関連する安全性チェックのサポートが追加された。 • SDKのバージョンが更新された。 • Grok 4に関するドキュメントが更新された。 • ruffのルールが追加され、コードの品質が向上した。 • 問題を自動的に修正する機能が実装された。

langchain-ai/langchain 2025/07/14

release tool

Cost Effective Deployment of DeepSeek R1 with Intel® Xeon® 6 CPU on SGLang

<p>The impressive performance of DeepSeek R1 marked a rise of giant Mixture of Experts (MoE) models in Large Language Models (LLM). However, its massive mode...

LMSYS Blog 2025/07/14

library tool

Release v3.23.8

RooCodeIncのGitHubリポジトリで公開されたリリースv3.23.8では、コードインデックスの有効/無効トグル機能が追加され、コマンドの自動承認設定に自動拒否リストが追加されました。また、履歴プレビューの履歴タブへのナビゲーションリンクも追加されています。これにより、ユーザーはコードのインデックス管理や履歴の確認がより便利になります。 • コードインデックスの有効/無効トグル機能の追加 • コマンドの自動承認設定に自動拒否リストの追加 • 履歴プレビューの履歴タブへのナビゲーションリンクの追加 • ユーザーの利便性向上

RooCodeInc/Roo-Code 2025/07/13

release tool

🥇Top AI Papers of the Week

The Top AI Papers of the Week (July 7 - 13)

Elvis Saravia's NLP Blog 2025/07/13

platform

サンドボックス環境を MCP サーバーで提供する Container Use

AI コーディングエージェントは便利ですが、任意の Bash コマンドを実行できるため、ユーザーのシステムに影響を与える可能性があります。Container Use は MCP サーバーとして動作し、AI コーディングエージェントにサンドボックス環境を提供します。この記事では Container Use の利用方法について紹介します。

azukiazusa のテックブログ2 2025/07/13

api tool

🤖 AI Agents Weekly: Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5

Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5

Elvis Saravia's NLP Blog 2025/07/12

platform

Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity

METR - for Model Evaluation & Threat Research - are a non-profit research institute founded by Beth Barnes, a former alignment researcher at OpenAI (see Wikipedia). They've previously contributed to …

Simon Willison's Blog 2025/07/12

tool

Grok 4 Heavy won't reveal its system prompt

Grok 4 Heavy is the "think much harder" version of Grok 4 that's currenly only available on their $300/month plan. Jeremy Howard relays a report from a Grok 4 Heavy …

Simon Willison's Blog 2025/07/12

platform

No Image

Quoting @grok

On the morning of July 8, 2025, we observed undesired responses and immediately began investigating. To identify the specific language in the instructions causing the undesired behavior, we conducted multiple …

Simon Willison's Blog 2025/07/12

platform

Release v3.23.7

この記事は、RooCodeIncのRoo-Codeリポジトリのバージョン3.23.7のリリースノートを提供しています。このリリースでは、Mermaid構文の警告修正、GCPのVertex AIのすべての利用可能なリージョンを含むように設定を拡張、埋め込みモデルの切り替え時にQdrantベクトルの次元不一致を処理する機能が追加されました。また、コメントやドキュメントの誤字修正、コードベース検索結果の表示改善、埋め込みエラーの翻訳フォールバックロジックの修正、MCPツールの無効化のクリーンアップ、モードとMCPタブからのマーケットプレイスへのリンク追加、TTSボタンの表示修正、Devstral Mediumモデルのサポート追加、コードインデックスサービスへの包括的なエラーテレメトリの追加、コンテキストウィンドウ計算からキャッシュトークンを除外する機能、コンテキスト発見のためのアーキテクトモードでの動的ツール選択の有効化、Claudeコード用の最大出力トークン設定の構成可能化が行われました。 • Mermaid構文の警告を修正 • GCP Vertex AIのすべてのリージョンを含む設定を拡張 • 埋め込みモデル切り替え時のQdrantベクトル次元不一致を処理 • コメントやドキュメントの誤字を修正 • コードベース検索結果の表示を改善 • 埋め込みエラーの翻訳フォールバックロジックを修正 • MCPツールの無効化をクリーンアップ • モードとMCPタブからマーケットプレイスへのリンクを追加 • TTSボタンの表示を修正 • Devstral Mediumモデルのサポートを追加 • コードインデックスサービスにエラーテレメトリを追加 • コンテキストウィンドウ計算からキャッシュトークンを除外 • アーキテクトモードでの動的ツール選択を有効化 • Claudeコード用の最大出力トークン設定を構成可能に

RooCodeInc/Roo-Code 2025/07/12

release tool

No Image

Musk’s latest Grok chatbot searches for billionaire mogul’s views before answering questions

I got quoted a couple of times in this story about Grok searching for tweets from:elonmusk by Matt O’Brien for the Associated Press. “It’s extraordinary,” said Simon Willison, an independent …

Simon Willison's Blog 2025/07/12

tool

moonshotai/Kimi-K2-Instruct

Colossal new open weights model release today from Moonshot AI, a two year old Chinese AI lab with a name inspired by Pink Floyd’s album The Dark Side of the …

Simon Willison's Blog 2025/07/11

tool

Advanced fine-tuning methods on Amazon SageMaker AI

When fine-tuning ML models on AWS, you can choose the right tool for your specific needs. AWS provides a comprehensive suite of tools for data scientists, ML engineers, and business users to achieve their ML goals. AWS has built solutions to support various levels of ML sophistication, from simple SageMaker training jobs for FM fine-tuning to the power of SageMaker HyperPod for cutting-edge research. We invite you to explore these options, starting with what suits your current needs, and evolve your approach as those needs change.

AWS Machine Learning Blog 2025/07/11

api cloud tool

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod

This post is co-written with Zhanghao Wu, co-creator of SkyPilot. The rapid advancement of generative AI and foundation models (FMs) has significantly increased computational resource requirements for machine learning (ML) workloads. Modern ML pipelines require efficient systems for distributing workloads across accelerated compute resources, while making sure developer productivity remains high. Organizations need infrastructure solutions […]

AWS Machine Learning Blog 2025/07/11

cloud tool

No Image

Quoting Django’s security policies

Following the widespread availability of large language models (LLMs), the Django Security Team has received a growing number of security reports generated partially or entirely using such tools. Many of …

Simon Willison's Blog 2025/07/11

security

Intelligent document processing at scale with generative AI and Amazon Bedrock Data Automation

This post presents an end-to-end IDP application powered by Amazon Bedrock Data Automation and other AWS services. It provides a reusable AWS infrastructure as code (IaC) that deploys an IDP pipeline and provides an intuitive UI for transforming documents into structured tables at scale. The application only requires the user to provide the input documents (such as contracts or emails) and a list of attributes to be extracted. It then performs IDP with generative AI.

AWS Machine Learning Blog 2025/07/11

api tool

Build a conversational data assistant, Part 2 – Embedding generative business intelligence with Amazon Q in QuickSight

In this post, we dive into how we integrated Amazon Q in QuickSight to transform natural language requests like “Show me how many items were returned in the US over the past 6 months” into meaningful data visualizations. We demonstrate how combining Amazon Bedrock Agents with Amazon Q in QuickSight creates a comprehensive data assistant that delivers both SQL code and visual insights through a single, intuitive conversational interface—democratizing data access across the enterprise.

AWS Machine Learning Blog 2025/07/11

api tool

Build a conversational data assistant, Part 1: Text-to-SQL with Amazon Bedrock Agents

In this post, we focus on building a Text-to-SQL solution with Amazon Bedrock, a managed service for building generative AI applications. Specifically, we demonstrate the capabilities of Amazon Bedrock Agents. Part 2 explains how we extended the solution to provide business insights using Amazon Q in QuickSight, a business intelligence assistant that answers questions with auto-generated visualizations.

AWS Machine Learning Blog 2025/07/11

api tool

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

In this post, we discuss permission management strategies, focusing on attribute-based access control (ABAC) patterns that enable granular user access control while minimizing the proliferation of AWS Identity and Access Management (IAM) roles. We also share proven best practices that help organizations maintain security and compliance without sacrificing operational efficiency in their ML workflows.

AWS Machine Learning Blog 2025/07/11

api tool

Long-running execution flows now supported in Amazon Bedrock Flows in public preview

We announce the public preview of long-running execution (asynchronous) flow support within Amazon Bedrock Flows. With Amazon Bedrock Flows, you can link foundation models (FMs), Amazon Bedrock Prompt Management, Amazon Bedrock Agents, Amazon Bedrock Knowledge Bases, Amazon Bedrock Guardrails, and other AWS services together to build and scale predefined generative AI workflows.

AWS Machine Learning Blog 2025/07/11

api tool

Fraud detection empowered by federated learning with the Flower framework on Amazon SageMaker AI

In this post, we explore how SageMaker and federated learning help financial institutions build scalable, privacy-first fraud detection systems.

AWS Machine Learning Blog 2025/07/11

framework tool

Building intelligent AI voice agents with Pipecat and Amazon Bedrock – Part 2

In Part 1 of this series, you learned how you can use the combination of Amazon Bedrock and Pipecat, an open source framework for voice and multimodal conversational AI agents to build applications with human-like conversational AI. You learned about common use cases of voice agents and the cascaded models approach, where you orchestrate several components to build your voice AI agent. In this post (Part 2), you explore how to use speech-to-speech foundation model, Amazon Nova Sonic, and the benefits of using a unified model.

AWS Machine Learning Blog 2025/07/11

api tool

Uphold ethical standards in fashion using multimodal toxicity detection with Amazon Bedrock Guardrails

In the fashion industry, teams are frequently innovating quickly, often utilizing AI. Sharing content, whether it be through videos, designs, or otherwise, can lead to content moderation challenges. There remains a risk (through intentional or unintentional actions) of inappropriate, offensive, or toxic content being produced and shared. In this post, we cover the use of the multimodal toxicity detection feature of Amazon Bedrock Guardrails to guard against toxic content. Whether you’re an enterprise giant in the fashion industry or an up-and-coming brand, you can use this solution to screen potentially harmful content before it impacts your brand’s reputation and ethical standards. For the purposes of this post, ethical standards refer to toxic, disrespectful, or harmful content and images that could be created by fashion designers.

AWS Machine Learning Blog 2025/07/11

api tool

langchain-groq==0.3.6

この記事は、langchain-groqのバージョン0.3.6のリリースに関する情報を提供しています。このリリースでは、Grok 4に関するドキュメントの更新や、ロックファイルのバンプ、ruffによるスタックレベルの復元、オートフィックスの無効化、バグベアの追加、パッケージ全体にわたるルールの追加と修正が行われています。また、ChatGroqにサービスティアオプションが追加されました。これらの変更は、コードの品質向上や機能の拡張を目的としています。 • Grok 4に関するドキュメントが更新された • ロックファイルがバンプされた • ruffによるスタックレベルの復元とオートフィックスの無効化が行われた • バグベアがパッケージ全体に追加された • ChatGroqにサービスティアオプションが追加された

langchain-ai/langchain 2025/07/11

release tool

Patch Release v4.53.2

この記事は、Hugging FaceのTransformersライブラリのバージョン4.53.2のパッチリリースについて説明しています。このリリースには、いくつかのバグ修正が含まれています。具体的には、GLM-4.1Vモデルのファインチューニングとバッチ推論に関するバグの修正、Ascend NPUでのフラッシュアテンション2のエラー修正、GLM4.1vモデルのトレーニング時のエラー修正、ページアテンション生成におけるオフバイワンエラーの修正、smollm3用のトークナイザーマッピングの追加、スライディングウィンドウ機能のリバートと非推奨化、GLM4vのバッチビデオフォワードの修正、マスキングユーティリティにおけるposition_idsのデフォルト値の追加が含まれています。 • GLM-4.1Vモデルのファインチューニングとバッチ推論に関するバグ修正 • Ascend NPUでのフラッシュアテンション2のエラー修正 • GLM4.1vモデルのトレーニング時のエラー修正 • ページアテンション生成におけるオフバイワンエラーの修正 • smollm3用のトークナイザーマッピングの追加 • スライディングウィンドウ機能のリバートと非推奨化 • GLM4vのバッチビデオフォワードの修正 • マスキングユーティリティにおけるposition_idsのデフォルト値の追加

huggingface/transformers 2025/07/11

library release

GitHub Copilot NESの内部実装が公開、そして続・AIエディタ戦争

Copilot NESとは Copilot NES（Next Edit Suggestions）は2025年2月にリリースされたGitHub Copilotの内部機能です。コードの変更に連動して必要となる次の編集を予測し、タブキーを押しているだけで複数箇所にわたる修正を提案してくれます。通常のコード補完がカーソル位置の続きのコードを予測するのに対して、Copilot NESは「エディタ上の編集操作」の単位で続きを予測して補完します。 GitHub Next | Copilot Next Edit SuggestionsGitHub Next Project: Can we improve Copilot code completion by suggesting the next logical change, wherever it is in your project?GitHub Next この仕組みはCopilot NESの元ネタであるCursor Tab(Copilot++)によって実用化されましたが、Cursorはプロプライエタリなソフトウェアなので内部の詳細が分かり

Lai.so Blog 2025/07/11

library tool

The EU Code of Practice and future of AI in Europe

OpenAI joins the EU Code of Practice, advancing responsible AI while partnering with European governments to drive innovation, infrastructure, and economic growth.

OpenAI Blog 2025/07/11

tool

No Image

Generationship: Ep. #39, Simon Willison

I recorded this podcast episode with Rachel Chalmers a few weeks ago. We talked about the resurgence of blogging, the legacy of Google Reader, learning in public, LLMs as weirdly …

Simon Willison's Blog 2025/07/11

podcast

Grok: searching X for "from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)"

If you ask the new Grok 4 for opinions on controversial questions, it will sometimes run a search to find out Elon Musk’s stance before providing you with an answer. …

Simon Willison's Blog 2025/07/11

platform

checkpointpostgres==2.0.22

この記事は、GitHub上のlangchain-ai/langgraphリポジトリにおけるcheckpointpostgresのバージョン2.0.22のリリースに関する内容です。このリリースでは、いくつかのバグ修正や依存関係のアップグレードが行われています。具体的には、Pythonの無効なエスケープ警告の削除、内部ツールのロックファイルの更新、カスタムチェックポインタクラスとの互換性の復元、Pandasのシリアライズにおけるピクルフォールバックのサポート、NumPy配列のシリアライズのサポートなどが含まれています。また、PostgresSaverの接続要件に関するドキュメントの強化や、コードのリファクタリングも行われています。 • バージョン2.0.22のリリースに伴うバグ修正と依存関係のアップグレード • Pythonの無効なエスケープ警告を削除 • カスタムチェックポインタクラスとの互換性を復元 • Pandasのシリアライズにピクルフォールバックをサポート • NumPy配列のシリアライズをサポート • PostgresSaverの接続要件に関するドキュメントを強化 • コードのリファクタリングを実施

langchain-ai/langgraph 2025/07/10

library release tool

Grok 4

Released last night, Grok 4 is now available via both API and a paid subscription for end-users. Key characteristics: image and text input, text output. 256,000 context length (twice that …

Simon Willison's Blog 2025/07/10

api tool

New capabilities in Amazon SageMaker AI continue to transform how organizations develop AI models

In this post, we share some of the new innovations in SageMaker AI that can accelerate how you build and train AI models. These innovations include new observability capabilities in SageMaker HyperPod, the ability to deploy JumpStart models on HyperPod, remote connections to SageMaker AI from local development environments, and fully managed MLflow 3.0.

AWS Machine Learning Blog 2025/07/10

api cloud tool

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod

With a one-click installation of the Amazon Elastic Kubernetes Service (Amazon EKS) add-on for SageMaker HyperPod observability, you can consolidate health and performance data from NVIDIA DCGM, instance-level Kubernetes node exporters, Elastic Fabric Adapter (EFA), integrated file systems, Kubernetes APIs, Kueue, and SageMaker HyperPod task operators. In this post, we walk you through installing and using the unified dashboards of the out-of-the-box observability feature in SageMaker HyperPod. We cover the one-click installation from the Amazon SageMaker AI console, navigating the dashboard and metrics it consolidates, and advanced topics such as setting up custom alerts.

AWS Machine Learning Blog 2025/07/10

api cloud tool

Accelerating generative AI development with fully managed MLflow 3.0 on Amazon SageMaker AI

In this post, we explore how Amazon SageMaker now offers fully managed support for MLflow 3.0, streamlining AI experimentation and accelerating your generative AI journey from idea to production. This release transforms managed MLflow from experiment tracking to providing end-to-end observability, reducing time-to-market for generative AI development.

AWS Machine Learning Blog 2025/07/10

api cloud tool

Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle

In this post, we announce Amazon SageMaker HyperPod support for deploying foundation models from SageMaker JumpStart, as well as custom or fine-tuned models from Amazon S3 or Amazon FSx. This new capability allows customers to train, fine-tune, and deploy models on the same HyperPod compute resources, maximizing resource utilization across the entire model lifecycle.

AWS Machine Learning Blog 2025/07/10

api cloud tool

Supercharge your AI workflows by connecting to SageMaker Studio from Visual Studio Code

AI developers and machine learning (ML) engineers can now use the capabilities of Amazon SageMaker Studio directly from their local Visual Studio Code (VS Code). With this capability, you can use your customized local VS Code setup, including AI-assisted development tools, custom extensions, and debugging tools while accessing compute resources and your data in SageMaker Studio. In this post, we show you how to remotely connect your local VS Code to SageMaker Studio development environments to use your customized development environment while accessing Amazon SageMaker AI compute resources.

AWS Machine Learning Blog 2025/07/10

cloud tool

It speaks!

Steps to access different voice options: 1️⃣ Tap the voice mode button (wave icon next to microphone) in the lower right. 2️⃣ Once voice mode opens up, click the voice selector in the upper right of the screen (it kind of looks like a filter icon and is the one on the very right). 3️⃣ Swipe through the different voices to find your favorite.

YouTube OpenAI 2025/07/10

Use K8sGPT and Amazon Bedrock for simplified Kubernetes cluster maintenance

This post demonstrates the best practices to run K8sGPT in AWS with Amazon Bedrock in two modes: K8sGPT CLI and K8sGPT Operator. It showcases how the solution can help SREs simplify Kubernetes cluster management through continuous monitoring and operational intelligence.

AWS Machine Learning Blog 2025/07/10

cloud tool

How Rocket streamlines the home buying experience with Amazon Bedrock Agents

Rocket AI Agent is more than a digital assistant. It’s a reimagined approach to client engagement, powered by agentic AI. By combining Amazon Bedrock Agents with Rocket’s proprietary data and backend systems, Rocket has created a smarter, more scalable, and more human experience available 24/7, without the wait. This post explores how Rocket brought that vision to life using Amazon Bedrock Agents, powering a new era of AI-driven support that is consistently available, deeply personalized, and built to take action.

AWS Machine Learning Blog 2025/07/10

api tool

Build an MCP application with Mistral models on AWS

This post demonstrates building an intelligent AI assistant using Mistral AI models on AWS and MCP, integrating real-time location services, time data, and contextual memory to handle complex multimodal queries. This use case, restaurant recommendations, serves as an example, but this extensible framework can be adapted for enterprise use cases by modifying MCP server configurations to connect with your specific data sources and business systems.

AWS Machine Learning Blog 2025/07/10

cloud tool

Build real-time conversational AI experiences using Amazon Nova Sonic and LiveKit

mazon Nova Sonic is now integrated with LiveKit’s WebRTC framework, a widely used platform that enables developers to build real-time audio, video, and data communication applications. This integration makes it possible for developers to build conversational voice interfaces without needing to manage complex audio pipelines or signaling protocols. In this post, we explain how this integration works, how it addresses the historical challenges of voice-first applications, and some initial steps to start using this solution.

AWS Machine Learning Blog 2025/07/10

api tool

Gemini CLI tutorial — Will it replace Windsurf and Cursor?

Discover how to use Gemini CLI, Google's new open-source AI agent that brings Gemini directly to your terminal.

logrocket-dev 2025/07/10

api tool

Grok 4がリリース

xAIのGrok 4が公開されました。 Introducing Grok 4, the world's most powerful AI model. Watch the livestream now: https://t.co/59iDX5s2ck — xAI (@xai) July 10, 2025 モデルカードコンテキストウィンドウは256,000トークンです。Claude 4 Sonnetが200,000トークン。 Models / Grok 4 「Grok 4 Code」って何なのコーディングモデルの名前です。Claude Code的なCLIではなさそうです。OpenAIでいうCodex（モデルの方）になります。Redditのスレによると「Cursorで使える」というメッセージがコンソールにでていたらしいです。 Grok 4 by

Lai.so Blog 2025/07/10

api tool

The AI Cloud: A unified platform for AI workloads

We made it simple to build, preview, and ship any frontend, from marketing pages to dynamic apps, without managing infrastructure. Now we’re introducing the next layer: the Vercel AI Cloud.

Vercel Blog 2025/07/10

api cloud tool

Stress-testing AI products: A red-teaming playbook

Red-teaming reveals how AI fails at scale. Learn to embed adversarial testing into your sprints before your product becomes a headline.

logrocket-dev 2025/07/10

api tool

What Is AI Sentiment Analysis and How to Build It with n8n?

Learn how to use AI sentiment analysis in n8n to intelligently automate workflows. Explore sentiment types, real-world use cases, and step-by-step guidance to build an agent-based system that classifies and explains email intent.

n8n Blog 2025/07/10

api tool

Release v3.23.6

この記事は、RooCodeIncのGitHubリポジトリにおけるリリースv3.23.6についての情報を提供しています。このリリースは2025年7月10日に行われ、特定のコミット（39ab006）が含まれています。リリースノートには、タグの読み込みに関するエラーが発生したことが記載されていますが、具体的な変更点や新機能についての詳細は示されていません。リリースはGitHubの検証済み署名で作成されており、ユーザーは通知設定を変更するためにサインインする必要があります。 • リリースバージョンはv3.23.6である • リリース日は2025年7月10日である • 特定のコミットID（39ab006）が含まれている • リリースノートにはタグの読み込みエラーが記載されている • GitHubの検証済み署名で作成されている

RooCodeInc/Roo-Code 2025/07/10

release tool

Grok 4 の発表まとめ＆試してみた

Zenn schroneko 2025/07/10

api tool

Leader Spotlight: Building a human-focused AI product, with Cory Bishop

Cory Bishop talks about the role of human-centered design and empathy in Bubble’s no-code AI development product.

logrocket-dev 2025/07/10

tool

How to Build an Agent

Learn how to build an agent -- from choosing realistic task examples, to building the MVP to testing quality and safety, to deploying in production.

LangChain Blog 2025/07/10

api tool

Building the Hugging Face MCP Server

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/10

api cloud tool

Release v3.23.5

RooCodeIncのGitHubリポジトリで公開されたリリースv3.23.5では、いくつかの修正が行われました。具体的には、openFile関数内でdecodeURIComponentを使用する修正が含まれています。また、エラーメッセージをUIに送信する前に翻訳する修正も行われました。さらに、アカウントタブが表示されるようになりました。これらの修正は、ユーザーエクスペリエンスの向上を目的としています。 • openFile関数でdecodeURIComponentを使用する修正 • エラーメッセージをUIに送信する前に翻訳する修正 • アカウントタブが表示されるようになった

RooCodeInc/Roo-Code 2025/07/09

release tool

Release v3.23.4

RooCodeIncのGitHubリポジトリで公開されたリリースv3.23.4では、チャットエリアのアイコンが改善され、より発見しやすく一貫性のあるデザインが実現されました。また、.gitignoreによって除外されるべきディレクトリ結果を返すlist_filesのバグが修正され、UIを整えるためのオーバーフローヘッダーメニューが追加されました。さらに、nullのカスタムモード設定ファイルによって発生する「Cannot read properties of null」エラーの修正や、ネイティブタイトル属性をStandardTooltipコンポーネントに置き換えることで一貫性が向上しました。 • チャットエリアのアイコンが改善され、発見しやすくなった • list_filesのバグが修正され、.gitignoreによる除外が適切に行われるようになった • UIを整えるためのオーバーフローヘッダーメニューが追加された • nullのカスタムモード設定ファイルによるエラーが修正された • ネイティブタイトル属性がStandardTooltipコンポーネントに置き換えられ、一貫性が向上した

RooCodeInc/Roo-Code 2025/07/09

release tool

AWS AI infrastructure with NVIDIA Blackwell: Two powerful compute solutions for the next frontier of AI

In this post, we announce general availability of Amazon EC2 P6e-GB200 UltraServers and P6-B200 instances, powered by NVIDIA Blackwell GPUs, designed for training and deploying the largest, most sophisticated AI models.

AWS Machine Learning Blog 2025/07/09

cloud infra tool

Unlock retail intelligence by transforming data into actionable insights using generative AI with Amazon Q Business

Amazon Q Business for Retail Intelligence is an AI-powered assistant designed to help retail businesses streamline operations, improve customer service, and enhance decision-making processes. This solution is specifically engineered to be scalable and adaptable to businesses of various sizes, helping them compete more effectively. In this post, we show how you can use Amazon Q Business for Retail Intelligence to transform your data into actionable insights.

AWS Machine Learning Blog 2025/07/09

api cloud tool

0.5.2

この記事は、GitHub上のlangchain-ai/langgraphリポジトリのバージョン0.5.2のリリースノートについて説明しています。このリリースは2023年7月9日に行われ、主にバージョン0.5.1からのパッチが含まれています。具体的には、invokeおよびstreamに関するヒントの修正が行われ、CommandとNoneの両方を許可するようになりました。これにより、ユーザーはより柔軟にコマンドを使用できるようになります。 • バージョン0.5.2は2023年7月9日にリリースされた。 • 主な変更点は、invokeおよびstreamに関するヒントの修正である。 • 修正により、CommandとNoneの両方が許可されるようになった。 • この変更はユーザーにとっての柔軟性を向上させる。

langchain-ai/langgraph 2025/07/09

release tool

Infinite Monkey

Mihai Parparita's Infinite Mac lets you run classic MacOS emulators directly in your browser. Infinite Monkey is a new feature which taps into the OpenAI Computer Use and Claude Computer …

Simon Willison's Blog 2025/07/09

tool

2025-07-08

この記事は、Mastraの2025年7月8日のリリースに関する情報を提供しています。MastraはApache-2.0ライセンスの下で提供され、マルチモーダルプレイグラウンドが利用可能になりました。CLI/Playgroundに関する重要な修正や機能追加が行われ、特にデバッグを可能にするための'--inspect'フラグのサポートが追加されました。また、ワークフローに'イベント送信'機能が追加され、クライアントSDKにはabortSignalオプションがサポートされました。さらに、Google Geminiモデルとの互換性を向上させるためのZodNullスキーマの処理がサポートされ、メモリ管理やストレージの改善も行われました。 • MastraはApache-2.0ライセンスで提供される。 • マルチモーダルプレイグラウンドが利用可能になった。 • デバッグを可能にする'--inspect'フラグがCLIに追加された。 • ワークフローに'イベント送信'機能が追加された。 • クライアントSDKにabortSignalオプションが追加された。 • Google Geminiモデルとの互換性を向上させるためのZodNullスキーマの処理がサポートされた。 • メモリ管理やストレージの改善が行われた。

mastra-ai/mastra 2025/07/09

release tool

June 2025 (version 1.102)

Learn what is new in the Visual Studio Code June 2025 Release (1.102)

VS Code Blog 2025/07/09

api library tool

MedGemma: Our most capable open models for health AI development

Google Research 2025/07/09

api cloud tool

Democratize data for timely decisions with text-to-SQL at Parcel Perform

The business team in Parcel Perform often needs access to data to answer questions related to merchants’ parcel deliveries, such as “Did we see a spike in delivery delays last week? If so, in which transit facilities were this observed, and what was the primary cause of the issue?” Previously, the data team had to manually form the query and run it to fetch the data. With the new generative AI-powered text-to-SQL capability in Parcel Perform, the business team can self-serve their data needs by using an AI assistant interface. In this post, we discuss how Parcel Perform incorporated generative AI, data storage, and data access through AWS services to make timely decisions.

AWS Machine Learning Blog 2025/07/09

api tool

Query Amazon Aurora PostgreSQL using Amazon Bedrock Knowledge Bases structured data

In this post, we discuss how to make your Amazon Aurora PostgreSQL-Compatible Edition data available for natural language querying through Amazon Bedrock Knowledge Bases while maintaining data freshness.

AWS Machine Learning Blog 2025/07/09

api cloud tool

Configure fine-grained access to Amazon Bedrock models using Amazon SageMaker Unified Studio

In this post, we demonstrate how to use SageMaker Unified Studio and AWS Identity and Access Management (IAM) to establish a robust permission framework for Amazon Bedrock models. We show how administrators can precisely manage which users and teams have access to specific models within a secure, collaborative environment. We guide you through creating granular permissions to control model access, with code examples for common enterprise governance scenarios.

AWS Machine Learning Blog 2025/07/09

tool

Improve conversational AI response times for enterprise applications with the Amazon Bedrock streaming API and AWS AppSync

This post demonstrates how integrating an Amazon Bedrock streaming API with AWS AppSync subscriptions significantly enhances AI assistant responsiveness and user satisfaction. By implementing this streaming approach, the global financial services organization reduced initial response times for complex queries by approximately 75%—from 10 seconds to just 2–3 seconds—empowering users to view responses as they’re generated rather than waiting for complete answers.

AWS Machine Learning Blog 2025/07/09

api cloud tool

Scale generative AI use cases, Part 1: Multi-tenant hub and spoke architecture using AWS Transit Gateway

n this two-part series, we discuss a hub and spoke architecture pattern for building a multi-tenant and multi-account architecture. This pattern supports abstractions for shared services across use cases and teams, helping create secure, scalable, and reliable generative AI systems. In Part 1, we present a centralized hub for generative AI service abstractions and tenant-specific spokes, using AWS Transit Gateway for cross-account interoperability.

AWS Machine Learning Blog 2025/07/09

api cloud security

Reasoning reimagined: Introducing Phi-4-mini-flash-reasoning

Unlock faster, efficient reasoning with Phi-4-mini-flash-reasoning—optimized for edge, mobile, and real-time applications.

Microsoft AI Blog 2025/07/09

framework tool

Release v3.23.3

RooCodeIncのGitHubリポジトリでリリースされたバージョン3.23.3は、2025年7月9日に公開されました。このリリースでは、アナウンスモーダルから誤った行が削除されました。リリースノートには、特に新機能や改善点についての詳細は記載されていませんが、リリースの署名はGitHubの検証済み署名で行われています。 • リリースバージョンは3.23.3である • リリース日は2025年7月9日 • アナウンスモーダルから誤った行が削除された • リリースはGitHubの検証済み署名で行われた

RooCodeInc/Roo-Code 2025/07/09

release tool

Dive deeper with AI Mode and get gaming help in Circle to Search

We’re bringing new AI capabilities to Circle to Search, so you can dive deeper and ask follow-ups in AI Mode, and get gaming tips.

Google AI Blog 2025/07/09

tool

Release v3.23.2

RooCodeIncのRoo-Codeリポジトリで、バージョン3.23.2がリリースされました。このリリースは2025年7月9日に行われ、主に自動承認機能に関するバグ修正が含まれています。具体的には、自動承認が時折失敗する問題が修正されました。リリースはGitHub上で行われ、コミットはGitHubの検証済み署名で作成されています。 • バージョン3.23.2がリリースされた • リリース日は2025年7月9日 • 自動承認機能のバグが修正された • 自動承認が時折失敗する問題が解決された • コミットはGitHubの検証済み署名で作成された

RooCodeInc/Roo-Code 2025/07/09

release tool

Inngest joins the Vercel Marketplace

Build background jobs and AI workflows with Inngest, now on the Vercel Marketplace. Native support for Next.js, preview environments, and branching.

Vercel Blog 2025/07/09

api tool

Release v3.23.1

RooCodeIncのGitHubリポジトリで公開されたリリースv3.23.1は、2025年7月9日に行われたものである。このリリースでは、チャットテキストエリアの下にコードインデックスのドットを常に表示する機能が追加された。リリースノートには、特に新機能や修正点の詳細は記載されていないが、GitHubの署名付きコミットとして確認されている。 • リリースv3.23.1は2025年7月9日に公開された • チャットテキストエリアの下にコードインデックスのドットを常に表示する機能が追加された • リリースノートには新機能や修正点の詳細は記載されていない • リリースはGitHubの署名付きコミットとして確認されている

RooCodeInc/Roo-Code 2025/07/09

release tool

How Lush and Google Cloud AI are reinventing retail checkout

Cosmetics company Lush is embracing Google Cloud AI to improve how they work.

Google AI Blog 2025/07/09

api tool

Devin vs Cursor Background Agents: 完全自律型AIエージェントの性能比較

はじめに Cursor のBackground Agentsが GA になったので「Devinとどの程度たたかえるのか？」という疑問が湧いてきました。そこでTypeScriptのクイズ101問をすべて解くというタスクでDevinと戦ってもらいます。ここにスーパーサブのClaude Code Actionさんも参加してもらって三つ巴にします。チャンピオンを決めようや・・・お題はexercism/typescriptのリポジトリを筆者がエージェントタスク向けにフォークしたものを使います。Exercismはプログラミング学習サイトで、GitHubで公開している問題集とテストコードはAider PolyglotやRoo Codeなど実際のエージェント製品のベンチマークで使用されており、エージェント同士の比較に適しています。 GitHub - laiso/exercism-typescript: Exercism exercises in TypeScript.Exercism exercises in TypeScript. Contribute to laiso/exercism-t

Lai.so Blog 2025/07/09

api tool

Release v3.23.0

RooCodeのリリースv3.23.0では、コードベースのインデックス作成が実験的な状態から移行され、いくつかの新機能とバグ修正が行われた。新たにTODOリストツールが追加され、Gemini埋め込みプロバイダーがコードベースのインデックス作成に対応した。また、OpenAI互換プロバイダーでのフルエンドポイントURLのサポートや、マークダウンのサポートも追加された。設定内のAPIプロバイダー選択に検索/フィルタ機能が追加され、最大検索結果数を設定可能になった。その他、UIの一貫性やレイアウトの改善、タスクアクションにコピー・プロンプトボタンが追加されるなど、ユーザーエクスペリエンスの向上が図られた。 • コードベースのインデックス作成が実験的から正式に移行された • TODOリストツールが新たに追加された • Gemini埋め込みプロバイダーがコードベースのインデックス作成に対応 • OpenAI互換プロバイダーでフルエンドポイントURLをサポート • マークダウンのサポートが追加された • APIプロバイダー選択に検索/フィルタ機能が追加された • 最大検索結果数を設定可能になった • UIの一貫性やレイアウトの改善が行われた

RooCodeInc/Roo-Code 2025/07/09

release tool

slime: An SGLang-Native Post-Training Framework for RL Scaling

<h2><a id="vision-that-drives-slime" class="anchor" href="#vision-that-drives-slime" aria-hidden="true"><svg aria-hidden="true" class="octicon octicon-link" ...

LMSYS Blog 2025/07/09

framework tool

Mastra Changelog 2025-07-09

Mastra is now Apache-2.0 licensed, Playground goes multi-modal, new memory and RAG features, and more.

Mastra Blog 2025/07/09

ai api framework

Creating custom kernels for the AMD MI300

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/09

library tool

Upskill your LLMs with Gradio MCP Servers

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/09

api tool

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/09

library tool

langchain-ollama==0.3.4

この記事は、langchain-ollamaのバージョン0.3.4のリリースに関するもので、主にいくつかの修正と機能追加が行われたことを報告しています。具体的には、モデルの検証を修正し、呼び出しごとの推論設定が可能になったこと、ruffによるルールの追加と修正、ドキュメントの更新が含まれています。また、テストの更新や、エラーキャッチ機能の強化も行われています。これにより、langchain-ollamaの安定性と使いやすさが向上しています。 • langchain-ollamaのバージョン0.3.4がリリースされた • モデルの検証機能が修正された • 呼び出しごとの推論設定が可能になった • ruffによるルールの追加と修正が行われた • ドキュメントの更新が行われ、明確さが向上した • テストの更新が行われ、エラーキャッチ機能が強化された

langchain-ai/langchain 2025/07/08

release tool

Accelerate AI development with Amazon Bedrock API keys

Today, we’re excited to announce a significant improvement to the developer experience of Amazon Bedrock: API keys. API keys provide quick access to the Amazon Bedrock APIs, streamlining the authentication process so that developers can focus on building rather than configuration.

AWS Machine Learning Blog 2025/07/08

api cloud tool

cli==0.3.4

この記事は、GitHub上のlangchain-aiプロジェクトにおけるCLIツールのバージョン0.3.4のリリースに関する情報を提供しています。このリリースでは、いくつかの新機能や修正が行われました。具体的には、ビルド依存関係を保持するための引数が追加され、ドキュメントのファイルパスがより堅牢になるように更新されました。また、依存関係のアップグレードやロックファイルの更新も行われています。さらに、自己ホスト型プランのベータフラグが削除され、CLIのAPIの最小境界が引き上げられました。これらの変更により、CLIツールの使い勝手や安定性が向上しています。 • CLIツールのバージョン0.3.4がリリースされた。 • ビルド依存関係を保持するための引数が追加された。 • ドキュメントのファイルパスが更新され、例がより堅牢になった。 • 依存関係のアップグレードが行われた。 • 自己ホスト型プランのベータフラグが削除された。 • CLIのAPIの最小境界が引き上げられた。

langchain-ai/langgraph 2025/07/08

release tool

Affective Use of AI

We study Claude's performance on coding, reasoning, and knowledge tests. But we also research how people use Claude for emotional support and companionship. In this fireside chat, our team discusses our findings on how people are using Claude for emotional support, advice, and companionship. These insights directly inform our safeguards work—helping us build AI that's both helpful and safe.

YouTube Anthropic 2025/07/08

Accelerating data science innovation: How Bayer Crop Science used AWS AI/ML services to build their next-generation MLOps service

In this post, we show how Bayer Crop Science manages large-scale data science operations by training models for their data analytics needs and maintaining high-quality code documentation to support developers. Through these solutions, Bayer Crop Science projects up to a 70% reduction in developer onboarding time and up to a 30% improvement in developer productivity.

AWS Machine Learning Blog 2025/07/08

framework tool

Combat financial fraud with GraphRAG on Amazon Bedrock Knowledge Bases

In this post, we show how to use Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics to build a financial fraud detection solution.

AWS Machine Learning Blog 2025/07/08

api tool

Classify call center conversations with Amazon Bedrock batch inference

In this post, we demonstrate how to build an end-to-end solution for text classification using the Amazon Bedrock batch inference capability with the Anthropic’s Claude Haiku model. We walk through classifying travel agency call center conversations into categories, showcasing how to generate synthetic training data, process large volumes of text data, and automate the entire workflow using AWS services.

AWS Machine Learning Blog 2025/07/08

api tool

Effective cross-lingual LLM evaluation with Amazon Bedrock

In this post, we demonstrate how to use the evaluation features of Amazon Bedrock to deliver reliable results across language barriers without the need for localized prompts or custom infrastructure. Through comprehensive testing and analysis, we share practical strategies to help reduce the cost and complexity of multilingual evaluation while maintaining high standards across global large language model (LLM) deployments.

AWS Machine Learning Blog 2025/07/08

api tool

Cohere Embed 4 multimodal embeddings model is now available on Amazon SageMaker JumpStart

The Cohere Embed 4 multimodal embeddings model is now generally available on Amazon SageMaker JumpStart. The Embed 4 model is built for multimodal business documents, has leading multilingual capabilities, and offers notable improvement over Embed 3 across key benchmarks. In this post, we discuss the benefits and capabilities of this new model. We also walk you through how to deploy and use the Embed 4 model using SageMaker JumpStart.

AWS Machine Learning Blog 2025/07/08

api tool

Meet the Builders: Highlights from the MCP Server Builder Meetup

Unlock microservices potential with Apollo GraphQL. Seamlessly integrate APIs, manage data, and enhance performance. Explore Apollo's innovative solutions.

apollo-blog 2025/07/08

api cloud tool

Working with 400,000 teachers to shape the future of AI in schools

OpenAI joins the American Federation of Teachers to launch the National Academy for AI Instruction.

OpenAI Blog 2025/07/08

tool

OpenAI 🤝 @teamganassi

Congrats on an amazing IndyCar weekend for Alex Palou! 📸: Larry Chen

YouTube OpenAI 2025/07/08

OME: Revolutionizing LLM Infrastructure with Model-Driven Architecture

<h2><a id="the-tale-of-two-teams-why-model-serving-is-broken" class="anchor" href="#the-tale-of-two-teams-why-model-serving-is-broken" aria-hidden="true"><sv...

LMSYS Blog 2025/07/08

cloud platform tool

SmolLM3: smol, multilingual, long-context reasoner

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog 2025/07/08

library tool

Cursorの価格設定変更の騒動について

2024年6月にCursorは価格体系を大幅に変更し、月額20ドルのProプランを「リクエスト数制限」から「トークン使用量制限」へと切り替え、さらに月額200ドルのUltraプランを新設しました。 Updates to Ultra and Pro | Cursor - The AI Code EditorIn collaboration with the model providers, we’re introducing a $200 / mo tier for power users.Cursor Cursorの説明によると、以前は月500リクエストまでの制限で、リクエストごとのトークン使用量は考慮されていませんでした。新しい料金モデルは1回のリクエストで消費するトークン数が大幅に異なるため、単純なリクエスト数制限ではコストを正確に反映できなくなりました。そのため、CursorはAPIベースのトークン使用量課金に移行し、Proプランには月20ドル分のトークンクレジットを含み、それを超えた分は追加課金となる形にしました。まずいことにCursorはこの変更をポジティブに伝えるた

Lai.so Blog 2025/07/07

tool

How INRIX accelerates transportation planning with Amazon Bedrock

INRIX pioneered the use of GPS data from connected vehicles for transportation intelligence. In this post, we partnered with Amazon Web Services (AWS) customer INRIX to demonstrate how Amazon Bedrock can be used to determine the best countermeasures for specific city locations using rich transportation data and how such countermeasures can be automatically visualized in street view images. This approach allows for significant planning acceleration compared to traditional approaches using conceptual drawings.

AWS Machine Learning Blog 2025/07/07

api cloud tool

Qwen3 family of reasoning models now available in Amazon Bedrock Marketplace and Amazon SageMaker JumpStart

Today, we are excited to announce that Qwen3, the latest generation of large language models (LLMs) in the Qwen family, is available through Amazon Bedrock Marketplace and Amazon SageMaker JumpStart. With this launch, you can deploy the Qwen3 models—available in 0.6B, 4B, 8B, and 32B parameter sizes—to build, experiment, and responsibly scale your generative AI applications on AWS. In this post, we demonstrate how to get started with Qwen3 on Amazon Bedrock Marketplace and SageMaker JumpStart.

AWS Machine Learning Blog 2025/07/07

tool

Agents as escalators: Real-time AI video monitoring with Amazon Bedrock Agents and video streams

In this post, we show how to build a fully deployable solution that processes video streams using OpenCV, Amazon Bedrock for contextual scene understanding and automated responses through Amazon Bedrock Agents. This solution extends the capabilities demonstrated in Automate chatbot for document and data retrieval using Amazon Bedrock Agents and Knowledge Bases, which discussed using Amazon Bedrock Agents for document and data retrieval. In this post, we apply Amazon Bedrock Agents to real-time video analysis and event monitoring.

AWS Machine Learning Blog 2025/07/07

api tool

langchain-mistralai==0.2.11

この記事は、langchain-mistralaiのバージョン0.2.11のリリースに関する情報を提供しています。このリリースでは、ruffに関連する問題を自動的に修正する機能が追加され、MistralAIのチャンクをAIMessageChunkに解析する際にfinish_reasonをレスポンスメタデータに含めるように改善されました。また、コメント内の誤字を修正し、互換性に関する注意点が改善されました。標準テストとしてベンチマークが追加され、langchainおよび関連ライブラリのPythonの上限が削除されました。コードはPython 3.9の標準に合わせて更新されました。 • langchain-mistralaiのバージョン0.2.11がリリースされた • ruffに関連する問題を自動的に修正する機能が追加された • MistralAIのチャンクをAIMessageChunkに解析する際にfinish_reasonをレスポンスメタデータに含めるように改善された • コメント内の誤字が修正され、互換性に関する注意点が改善された • 標準テストとしてベンチマークが追加された • langchainおよび関連ライブラリのPythonの上限が削除された • コードはPython 3.9の標準に合わせて更新された

langchain-ai/langchain 2025/07/07

api library release

Quoting Aphyr

I strongly suspect that Market Research Future, or a subcontractor, is conducting an automated spam campaign which uses a Large Language Model to evaluate a Mastodon instance, submit a plausible …

Simon Willison's Blog 2025/07/07

platform

New AI tools for mental health research and treatment

This field guide and investment support AI’s potential in evidence-based mental health interventions and research.

Google AI Blog 2025/07/07

tool

Become a command-line superhero with Simon Willison's llm tool

Christopher Smith ran a mini hackathon in Albany New York at the weekend around uses of my LLM - the first in-person event I'm aware of dedicated to that project! …

Simon Willison's Blog 2025/07/07

api tool

No Image

Enabling Fully Sharded Data Parallel (FSDP2) in Opacus

PyTorch Blog 2025/07/07

library tool

v0.17.2 Patch Release

DeepSpeedのv0.17.2パッチリリースでは、いくつかの重要な修正と改善が行われました。主な変更点には、Arctic Long Sequence Training (ALST)の名称変更、set_start_methodの破損を防ぐ修正、<glog/logging.h>のエラー修正、コンパイル用のパディングユーティリティの改善、404エラーの修正、チュートリアルタイトルの修正、再コンパイルのための実際の入力の復元、WarmupLRの最適化子lrの継承に関する修正、torch.autocastのZeROとの統合、F.interpolateのフロップスプロファイラーサポート、FP8ユニットテストの許容誤差の緩和、DeepCompileのZeROステージ1およびステージ2サポートの追加などが含まれています。これらの変更により、DeepSpeedの機能性と安定性が向上しました。 • Arctic Long Sequence Training (ALST)の名称変更が行われた。 • set_start_methodの破損を防ぐ修正が施された。 • <glog/logging.h>に関するエラーが修正された。 • コンパイル用のパディングユーティリティが改善された。 • 404エラーやチュートリアルタイトルの修正が行われた。 • torch.autocastとZeROの統合が有効化された。 • DeepCompileのZeROステージ1およびステージ2のサポートが追加された。

microsoft/DeepSpeed 2025/07/07

release tool

Introducing Deep Research in Azure AI Foundry Agent Service

Announcing the public preview of Deep Research in Azure AI Foundry—an API and SDK-based offering of OpenAI’s advanced agentic research capability. Learn more.

Microsoft AI Blog 2025/07/07

api cloud tool

The Best AI Coding Tools in 2025

Discover the best AI tools for coding in 2025 and transform how you build with these powerful coding assistants.

Builder.io Blog 2025/07/07

api library tool

Adding a feature because ChatGPT incorrectly thinks it exists

Adrian Holovaty describes how his SoundSlice service saw an uptick in users attempting to use their sheet music scanner to import ASCII-art guitar tab... because it turned out ChatGPT had …

Simon Willison's Blog 2025/07/07

api tool

917: AI Tools You Should Know

syntax-fm 2025/07/07

tool

I Shipped a macOS App Built Entirely by Claude Code

Indragie Karunaratne has "been building software for the Mac since 2008", but recently decided to try Claude Code to build a side project: Context, a native Mac app for debugging …

Simon Willison's Blog 2025/07/06

library tool

🥇Top AI Papers of the Week

The Top AI Papers of the Week (June 30 - July 6)

Elvis Saravia's NLP Blog 2025/07/06

platform

Quoting Nineteen Eighty-Four

There was a whole chain of separate departments dealing with proletarian literature, music, drama, and entertainment generally. Here were produced rubbishy newspapers containing almost nothing except sport, crime and astrology, …

Simon Willison's Blog 2025/07/06

platform

No Image

Supabase MCP can leak your entire SQL database

Here's yet another example of a lethal trifecta attack, where an LLM system combines access to private data, exposure to potentially malicious instructions and a mechanism to communicate data back …

Simon Willison's Blog 2025/07/06

database security

Context Engineering Guide

Prompt engineering is being rebranded as context engineering

Elvis Saravia's NLP Blog 2025/07/05

api tool

🤖 AI Agents Weekly: DeepSWE, Cursor 1.2, Evaluating Multi-Agent Systems, Prover Agent, Top AI Devs News

DeepSWE, Cursor 1.2, Evaluating Multi-Agent Systems, Prover Agent, Top AI Devs News

Elvis Saravia's NLP Blog 2025/07/05

library tool

No Image

Cursor: Clarifying Our Pricing

Cursor changed their pricing plan on June 16th, introducing a new $200/month Ultra plan with "20x more usage than Pro" and switching their $20/month Pro plan from "request limits to …

Simon Willison's Blog 2025/07/05

api tool

No Image

Identify, solve, verify

The more time I spend using LLMs for code, the less I worry for my career - even as their coding capabilities continue to improve. Using LLMs as part of …

Simon Willison's Blog 2025/07/04

platform

No Image

awwaiid/gremllm

Delightfully cursed Python library by Brock Wilcox, built on top of LLM: from gremllm import Gremllm counter = Gremllm("counter") counter.value = 5 counter.increment() print(counter.value) # 6? print(counter.to_roman_numerals()) # VI? You …

Simon Willison's Blog 2025/07/04

library tool

How to build a web-based AI agent with Stagehand and Gemini

Learn how to build a browser-based AI agent with Stagehand and Gemini to automate tasks like navigation, extraction, and interaction using natural language.

logrocket-dev 2025/07/04

api tool

Patch Release v4.53.1

この記事は、Hugging FaceのTransformersライブラリのバージョン4.53.1のパッチリリースについて説明しています。このリリースには、いくつかのバグ修正が含まれています。具体的には、tpプラグインの保護されていないインポートの修正、VLMのキー割り当ての修正、Gemma3nに関する複数の修正、FA2推論の修正、ビデオ推論の修正、マルチモーダルプロセッサの初期化時に重複引数を受け取る問題の修正、オプティマイザの作成を遅延させる際にモデルのみを準備する修正、マスクを通じてflex/sdpa/eagerのためのパックされたテンソルフォーマットのサポート追加が含まれています。 • バージョン4.53.1のリリースには複数のバグ修正が含まれている • tpプラグインの保護されていないインポートの修正が行われた • VLMのキー割り当てが修正された • Gemma3nに関する複数の修正が含まれている • FA2推論とビデオ推論の修正が行われた • マルチモーダルプロセッサの初期化時の重複引数の問題が修正された • オプティマイザの作成を遅延させる際にモデルのみを準備する修正が行われた • flex/sdpa/eagerのためのパックされたテンソルフォーマットのサポートが追加された

huggingface/transformers 2025/07/04

release tool

@browserbasehq/[email protected]

この記事は、GitHub上で公開された@browserbasehq/stagehandのバージョン2.4.1のリリースノートを提供しています。このリリースには、いくつかのパッチ変更が含まれており、具体的には、デフォルトのダウンロード動作の設定、シャドウDOM内の要素に対する「not-supported」の返却、自動タブ閉鎖の無効化、スキーマなしの抽出オプションに対するデフォルトスキーマの設定、OSレベルのドロップダウンの処理改善、ターゲットワーカーや共有ワーカーへのフィルタリングの改善が含まれています。これらの変更は、ユーザーエクスペリエンスの向上を目的としています。 • デフォルトのダウンロード動作を設定 • シャドウDOM内の要素に対して「not-supported」を返す • 自動タブ閉鎖を無効化 • スキーマなしの抽出オプションにデフォルトスキーマを設定 • OSレベルのドロップダウンの処理を改善 • ターゲットワーカーや共有ワーカーへのフィルタリングを改善

browserbase/stagehand 2025/07/04

release tool

No Image

Quoting Adam Gordon Bell

I think that a lot of resistance to AI coding tools comes from the same place: fear of losing something that has defined you for so long. People are reacting …

Simon Willison's Blog 2025/07/03

platform

No Image

Frequently Asked Questions (And Answers) About AI Evals

Hamel Husain and Shreya Shankar have been running a paid, cohort-based course on AI Evals For Engineers & PMs over the past few months. Here Hamel collects answers to the …

Simon Willison's Blog 2025/07/03

platform

No Image

Trial Court Decides Case Based On AI-Hallucinated Caselaw

Joe Patrice writing for Above the Law: [...] it was always only a matter of time before a poor litigant representing themselves fails to know enough to sniff out and …

Simon Willison's Blog 2025/07/03

platform

langchain-anthropic==0.3.17

この記事は、GitHub上でのlangchain-anthropicのバージョン0.3.17のリリースに関する情報を提供しています。このリリースは2023年7月3日に行われ、主な変更点として、テストの一時的なスキップ、リリースの整形、ドキュメントのクリーンアップ、ruff banditルールの追加が含まれています。これにより、開発者は新しい機能や修正を利用できるようになります。 • langchain-anthropicのバージョン0.3.17がリリースされた • テストが一時的にスキップされた • ドキュメントの整形が行われた • ruff banditルールが追加された • リリース日は2023年7月3日である

langchain-ai/langchain 2025/07/03

release tool

langchain-core==0.3.68

この記事は、GitHub上でのlangchain-coreのバージョン0.3.68のリリースに関する情報を提供しています。このリリースは2023年7月3日に行われ、主な変更点として、OpenAIツールのテストにおいてパラメトリックテストが使用されるようになったこと、FileCallbackHandlerに対してコンテキストマネージャが使用されるようになったことが挙げられています。これにより、コードのテストやファイル処理の効率が向上することが期待されます。 • langchain-coreのバージョン0.3.68がリリースされた • リリース日は2023年7月3日 • OpenAIツールのテストにパラメトリックテストが導入された • FileCallbackHandlerにコンテキストマネージャが使用されるようになった • これによりテストやファイル処理の効率が向上することが期待される

langchain-ai/langchain 2025/07/03

release tool

Create with Claude by describing what you want to make

With artifacts in Claude, you can turn your ideas into interactive experiences. No coding required—just describe what you want to create.

YouTube Anthropic 2025/07/03

No Image

Sandboxed tools in a loop

Something I've realized about LLM tool use is that it means that if you can reduce a problem to something that can be solved by an LLM in a sandbox …

Simon Willison's Blog 2025/07/03

tool

Transforming network operations with AI: How Swisscom built a network assistant using Amazon Bedrock

In this post, we explore how Swisscom developed their Network Assistant. We discuss the initial challenges and how they implemented a solution that delivers measurable benefits. We examine the technical architecture, discuss key learnings, and look at future enhancements that can further transform network operations.

AWS Machine Learning Blog 2025/07/03

api tool

End-to-End model training and deployment with Amazon SageMaker Unified Studio

In this post, we guide you through the stages of customizing large language models (LLMs) with SageMaker Unified Studio and SageMaker AI, covering the end-to-end process starting from data discovery to fine-tuning FMs with SageMaker AI distributed training, tracking metrics using MLflow, and then deploying models using SageMaker AI inference for real-time inference. We also discuss best practices to choose the right instance size and share some debugging best practices while working with JupyterLab notebooks in SageMaker Unified Studio.

AWS Machine Learning Blog 2025/07/03

api cloud tool

Getting started with Claude 4 API: A developer’s walkthrough

This guide explores how to use Anthropic's Claude 4 models, including Opus 4 and Sonnet 4, to build AI-powered applications.

logrocket-dev 2025/07/03

api tool

No Image

Table saws

Quitting programming as a career right now because of LLMs would be like quitting carpentry as a career thanks to the invention of the table saw.

Simon Willison's Blog 2025/07/03

platform

Beyond Workflows: Introducing Agent Network (vNext)

Agent Network (vNext) introduces intelligent AI orchestration that automatically routes and executes complex multi-agent tasks without predetermined workflows.

Mastra Blog 2025/07/03

ai api framework

Mastra Changelog 2025-07-03

Agent Network (vNext), workflow cancellation, and custom memory model support highlight this week's Mastra updates.

Mastra Blog 2025/07/03

ai api framework

Release v3.22.6

RooCodeIncのRoo-Codeのリリースv3.22.6では、いくつかの新機能とバグ修正が行われた。新機能には、フォローアップ質問のためのタイマーによる自動承認、インポート/エクスポートモードの機能、チャット画面における持続的なバージョンインジケーター、拡張機能の起動時に自動的に設定をインポートする機能、ユーザーが設定可能なセマンティック検索のスコア閾値スライダーが追加された。また、いくつかのバグ修正も行われ、AWS Bedrockのクロスリージョン推論プロファイルマッピングの修正や、APIのリトライの指数バックオフの上限を10分に設定する修正が含まれている。これにより、ユーザーはよりスムーズな体験を得ることができる。 • フォローアップ質問のためのタイマーによる自動承認機能の追加 • インポート/エクスポートモードの機能追加 • チャット画面に持続的なバージョンインジケーターを追加 • 拡張機能の起動時に自動的に設定をインポートする機能の追加 • ユーザーが設定可能なセマンティック検索のスコア閾値スライダーの追加 • AWS Bedrockのクロスリージョン推論プロファイルマッピングの修正 • APIのリトライの指数バックオフの上限を10分に設定する修正

RooCodeInc/Roo-Code 2025/07/02

release tool

0.5.1

この記事は、GitHub上のlangchain-ai/langgraphリポジトリのバージョン0.5.1のリリースに関する情報を提供しています。このリリースでは、いくつかの重要な変更が行われました。具体的には、非推奨のpydanticロジックの削除や、型付き辞書に対するスキーマ生成の動作修正が含まれています。また、内部ツールの依存関係のロックファイルの更新や、壊れたリンクの修正も行われました。さらに、langchain-coreから認識されたツールメッセージコンテンツブロックタイプのインポートが追加され、バージョン0.5.1がリリースされました。 • 非推奨のpydanticロジックを削除 • 型付き辞書に対するスキーマ生成の動作を修正 • 内部ツールの依存関係のロックファイルを更新 • 壊れたリンクを修正 • langchain-coreからツールメッセージコンテンツブロックタイプをインポート

langchain-ai/langgraph 2025/07/02

release tool

Optimize RAG in production environments using Amazon SageMaker JumpStart and Amazon OpenSearch Service

In this post, we show how to use Amazon OpenSearch Service as a vector store to build an efficient RAG application.

AWS Machine Learning Blog 2025/07/02

api library tool

ChatGPT almost wasn't named ChatGPT

Head of ChatGPT Nick Turley talks about the days leading up to the launch of ChatGPT on episode 2 of the OpenAI Podcast. Watch the full episode here: https://www.youtube.com/watch?v=atXyXP3yYZ4

YouTube OpenAI 2025/07/02

0.48.0 - 2025-07-02

この記事は、OpenHandsのバージョン0.48.0のリリースノートを提供しています。このリリースでは、ユーザーのディレクトリからマイクロエージェントを読み取る機能が追加され、.cursorrulesファイルのサポートも導入されました。また、会話を停止する機能や、イベントストリームでのsetup.shスクリプトの実行が可能になりました。CLIランタイムではJupyterプラグインがデフォルトで無効化され、--fileオプションがよりインタラクティブで使いやすくなりました。さらに、LLM設定の更新が既存の会話に適用されるようになり、CLIの終了メッセージも改善されました。 • ユーザーのディレクトリからマイクロエージェントを読み取る機能の追加 • 新たに.cursorrulesファイルのサポート • 会話を停止する機能の追加 • setup.shスクリプトの実行をイベントストリームで確認可能に • CLIランタイムでJupyterプラグインがデフォルトで無効化 • --fileオプションのインタラクティブ性向上 • LLM設定の更新が既存の会話に適用されるように • CLIの終了メッセージの改善

All-Hands-AI/OpenHands 2025/07/02

release tool

Advancing AI agent governance with Boomi and AWS: A unified approach to observability and compliance

In this post, we share how Boomi partnered with AWS to help enterprises accelerate and scale AI adoption with confidence using Agent Control Tower.

AWS Machine Learning Blog 2025/07/02

api cloud tool

No Image

Reducing Storage Footprint and Bandwidth Usage for Distributed Checkpoints with PyTorch DCP

PyTorch Blog 2025/07/02

library tool

2025-07-01

この記事は、mastra-aiのリリースノートに関するもので、2025年7月1日に行われた更新内容を詳述しています。主な変更点には、非同期タイトル生成プロセスの待機処理の改善、Playgroundのワークフロー処理の向上、エンドツーエンドテストの開始、UIのカスタムチャットスレッドタイトル表示の修正、メモリモジュールでのカスタム言語モデルの指定機能の追加などが含まれています。また、エラーメッセージの改善やデータベース接続オブジェクトの公開、ツールバンドルプロセスのCloudflare Workersとの互換性向上なども報告されています。これらの変更は、ユーザー体験の向上やシステムの安定性を目的としています。 • 非同期タイトル生成プロセスの待機処理を改善し、生成完了前にプロセスが終了することによる失敗を防止する。 • Playgroundのワークフロー処理を改善し、より効率的な動作を実現する。 • エンドツーエンドテストをPlaywrightを使用して開始し、テストの信頼性を向上させる。 • メモリモジュールでカスタム言語モデルを指定できるようにし、ユーザーがエージェントのデフォルトモデルを上書きできるようにする。 • エラーメッセージを明確にし、ユーザーが不明なメッセージを送信した際に具体的なアクションを提示する。

mastra-ai/mastra 2025/07/02

api release tool

The latest AI news we announced in June

Here are Google’s latest AI updates from June 2025

Google AI Blog 2025/07/02

tool

Context Engineering

TL;DR Agents need context to perform tasks. Context engineering is the art and science of filling the context window with just the right information at each step of an agent’s trajectory. In this post, we break down some common strategies — write, select, compress, and isolate — for context engineering

LangChain Blog 2025/07/02

api tool

No Image

Quoting Charles Babbage

On two occasions I have been asked, — "Pray, Mr. Babbage, if you put into the machine wrong figures, will the right answers come out ?" In one case a …

Simon Willison's Blog 2025/07/02

platform

t-wada vs テスト大好郎

先日一部のClaude Codeユーザーの間で「プロンプトに”t-wadaさんの推奨する進め方に従ってください”と書くとテスト駆動開発のプラクティスを実践してくれる」というTIPSが話題になっていました。なるほど、TDDやテスト駆動開発という言葉は広まりすぎて「意味の希薄化」が発生し、曖昧な理解のまま自動テストやテストファーストと混同され、それがLLMの学習データにも影響したが、人名を与えるとLLMに「具体的な参照点」を与え、より具体的なプログラミングスタイルに限定させる効果があったのか pic.twitter.com/p6SCPj8YdA — Takuto Wada (@t_wada) June 25, 2025 これは確かに面白い現象で、現にClaudeに直接質問するとt-wadaさんの知識を持っていることがわかります。そこから連想してClaude CodeがTDDをするトリガーとして使えるのなら面白いなと思い色々試してみました。（ところでこの翌日、最近バイブコーディングにはまってSmalltalkのライブラリをLLMで書いているKent Beckも自著のタイトルを

Lai.so Blog 2025/07/02

api testing tool

AI dev tool power rankings & comparison [July 2025 edition]

Which AI frontend dev tool reigns supreme in July 2025? Check out our power rankings and use our interactive comparison tool to find out.

logrocket-dev 2025/07/02

tool

AI Agentが回答に困った時にSlackで人間に助言を求められるMCPを検証した

AI ShiftのTECH BLOGです。AI技術の情報や活用方法などをご案内いたします。

AI-Shift Tech Blog 2025/07/02

api tool

cli-1.1.4

この記事は、GitHub上でのchroma-coreプロジェクトのCLIバージョン1.1.4のリリースに関する情報を提供しています。このリリースは2023年7月2日に行われ、GitHubの署名付きコミットとして記録されています。リリースには6つのアセットが含まれていることが示されていますが、詳細な内容や変更点については記載されていません。ユーザーは、リリースノートを通じて新機能や修正点を確認することが期待されます。 • CLIバージョン1.1.4が2023年7月2日にリリースされた • リリースはGitHubの署名付きコミットとして記録されている • リリースには6つのアセットが含まれている • 具体的な変更点や新機能については記載がない

chroma-core/chroma 2025/07/02

release tool

Mandelbrot in x86 assembly by Claude

Inspired by a tweet asking if Claude knew x86 assembly, I decided to run a bit of an experiment. I prompted Claude Sonnet 4: Write me an ascii art mandelbrot …

Simon Willison's Blog 2025/07/02

library tool

No Image

TIL: Using Playwright MCP with Claude Code

Inspired by Armin ("I personally use only one MCP - I only use Playwright") I decided to figure out how to use the official Playwright MCP server with Claude Code. …

Simon Willison's Blog 2025/07/01

api tool

No Image

Quoting Kevin Webb

One of the best examples of LLM developer tooling I've heard is from a team that supports software from the 80s-90s. Their only source of documentation is video interviews with …

Simon Willison's Blog 2025/07/01

tool

Use Amazon SageMaker Unified Studio to build complex AI workflows using Amazon Bedrock Flows

In this post, we demonstrate how you can use SageMaker Unified Studio to create complex AI workflows using Amazon Bedrock Flows.

AWS Machine Learning Blog 2025/07/01

tool

No Image

A custom template system from the mid-2000s era

Using LLMs for code archaeology is pretty fun. I stumbled across this blog entry from 2003 today, in which I had gotten briefly excited about ColdFusion and implemented an experimental …

Simon Willison's Blog 2025/07/01

library tool

Accelerating AI innovation: Scale MCP servers for enterprise workloads with Amazon Bedrock

In this post, we present a centralized Model Context Protocol (MCP) server implementation using Amazon Bedrock that provides shared access to tools and resources for enterprise AI workloads. The solution enables organizations to accelerate AI innovation by standardizing access to resources and tools through MCP, while maintaining security and governance through a centralized approach.

AWS Machine Learning Blog 2025/07/01

api tool

langchain-ollama==0.3.5

Accenture scales video analysis with Amazon Nova and Amazon Bedrock Agents

Voxtral

common-pile/caselaw_access_project

Deploy conversational agents with Vonage and Amazon Nova Sonic

LangSmith and LangGraph Platform are now available in AWS Marketplace

2025-07-15

More advanced AI capabilities are coming to Search

Open Deep Research

Enabling customers to deliver production-ready AI agents at scale

How to build unified AI interfaces using the Vercel AI SDK

Here's how to make these in ChatGPT

0.49.0 - 2025-07-16

ModernBERT Decoder (based on v4.53.2)

checkpointpostgres==2.0.23

Google France hosted a hackathon to tackle healthcare's biggest challenges

How to support new VLMs into SGLang: A Case Study with NVILA

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

Amazon Bedrock Knowledge Bases now supports Amazon OpenSearch Service Managed Cluster as vector store

Monitor agents built on Amazon Bedrock with Datadog LLM Observability

How PayU built a secure enterprise AI assistant using Amazon Bedrock

Claude Connectors

langchain-core==0.3.69

Reflections on OpenAI

What's an AI Agent?

Brad Lightcap and Ronnie Chatterji on jobs, growth, and the AI economy — the OpenAI Podcast Ep. 3

The next wave of AI for content creation includes digital twins

xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated"

Supercharge generative AI workflows with NVIDIA DGX Cloud on AWS and Amazon Bedrock Custom Model Import

Accelerate generative AI inference with NVIDIA Dynamo and Amazon EKS

Moonshot AI&apos;s Kimi K2 model is now supported in Vercel AI Gateway

AWS doubles investment in AWS Generative AI Innovation Center, marking two years of customer success

AI compliance: A core product competency you shouldn’t skip

Release v3.23.12

AWS の エージェント IDE Kiro を使ってみた

A summer of security: empowering cyber defenders with AI

Release v3.23.11

Intellectual freedom by design

Application development without programmers

Release v3.23.10

0.5.3

Release v3.23.9

Discover tools that work with Claude

ccusage

Build AI-driven policy creation for vehicle data collection and automation using Amazon Bedrock

How Rapid7 automates vulnerability risk scores with ML pipelines using Amazon SageMaker AI

Build secure RAG applications with AWS serverless data lakes

langchain-openai==0.3.28

Cost Effective Deployment of DeepSeek R1 with Intel® Xeon® 6 CPU on SGLang

Release v3.23.8

🥇Top AI Papers of the Week

サンドボックス環境を MCP サーバーで提供する Container Use

🤖 AI Agents Weekly: Grok 4, Context Engineering Guide, Kimi K2, SmolLM3, MedGemma 27B, AI SDK 5

Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity

Grok 4 Heavy won't reveal its system prompt

Quoting @grok

Release v3.23.7

Musk’s latest Grok chatbot searches for billionaire mogul’s views before answering questions

moonshotai/Kimi-K2-Instruct

Advanced fine-tuning methods on Amazon SageMaker AI

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod

Quoting Django’s security policies

Intelligent document processing at scale with generative AI and Amazon Bedrock Data Automation

Build a conversational data assistant, Part 2 – Embedding generative business intelligence with Amazon Q in QuickSight

Build a conversational data assistant, Part 1: Text-to-SQL with Amazon Bedrock Agents

Implement user-level access control for multi-tenant ML platforms on Amazon SageMaker AI

Long-running execution flows now supported in Amazon Bedrock Flows in public preview

Fraud detection empowered by federated learning with the Flower framework on Amazon SageMaker AI

Building intelligent AI voice agents with Pipecat and Amazon Bedrock – Part 2

Uphold ethical standards in fashion using multimodal toxicity detection with Amazon Bedrock Guardrails

langchain-groq==0.3.6

Patch Release v4.53.2

GitHub Copilot NESの内部実装が公開、そして続・AIエディタ戦争

The EU Code of Practice and future of AI in Europe

Generationship: Ep. #39, Simon Willison

Grok: searching X for "from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)"

checkpointpostgres==2.0.22

Grok 4

New capabilities in Amazon SageMaker AI continue to transform how organizations develop AI models

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod

Moonshot AI's Kimi K2 model is now supported in Vercel AI Gateway

AWS のエージェント IDE Kiro を使ってみた