hiroppy's site

RSS Feeds:

Last updated: 2026/01/10 15:01

Release v3.39.2

RooCodeIncのRoo-Codeのリリースv3.39.2では、いくつかのバグ修正と機能改善が行われた。具体的には、Cerebrasとの互換性を確保するためにツールの厳格モード値を一貫させる修正や、OpenAI互換プロバイダーのためにconvertToSimpleMessagesを削除する修正が含まれている。また、Geminiとの互換性を保つためにアシスタントメッセージの内容が未定義にならないようにする修正も行われた。新機能としては、プロバイダーからのストリーム終了エラーに対するエラーメッセージの改善や、トラブルシューティングを容易にするためのデバッグ設定の追加がある。CLIサポートのために、@roo-code/typesや@roo-code/coreに機能が追加され、CLI開発に役立つスラッシュコマンドも導入された。 • Cerebrasとの互換性を確保するためにツールの厳格モード値を一貫させる修正 • OpenAI互換プロバイダーのためにconvertToSimpleMessagesを削除 • Geminiとの互換性を保つためにアシスタントメッセージの内容が未定義にならないようにする修正 • プロバイダーからのストリーム終了エラーに対するエラーメッセージの改善 • トラブルシューティングを容易にするためのデバッグ設定の追加 • CLIサポートのために@roo-code/typesや@roo-code/coreに機能が追加 • CLI開発に役立つスラッシュコマンドの導入

RooCodeInc/Roo-Code2026-01-10

releasetool

Crossmodal search with Amazon Nova Multimodal Embeddings

In this post, we explore how Amazon Nova Multimodal Embeddings addresses the challenges of crossmodal search through a practical ecommerce use case. We examine the technical limitations of traditional approaches and demonstrate how Amazon Nova Multimodal Embeddings enables retrieval across text, images, and other modalities. You learn how to implement a crossmodal search system by generating embeddings, handling queries, and measuring performance. We provide working code examples and share how to add these capabilities to your applications.

AWS Machine Learning Blog2026-01-10

apicloudtool

stagehand/server v3.3.0

この記事は、GitHub上で公開されたstagehand/serverのバージョン3.3.0のリリースに関する情報を提供しています。このリリースでは、ハイブリッドモードのドキュメント更新、エージェントメッセージ処理の改善、ページのwaitForTimeout機能の追加、キャッシュが有効な場合のみXPathを計算するようにエージェントを更新、アクション後のスクリーンショット機能の追加、エージェントのロギングの改善などが行われました。また、Slackの参照をDiscordに置き換え、ツール関数と型のエクスポート、空のオブジェクトを強制するオプションパラメータの追加、AI SDKとのollamaサポートの修正、keyPressのControlまたはMetaキーの正規化の修正なども含まれています。これらの変更は、主にエージェントの機能性とユーザー体験の向上を目的としています。 • ハイブリッドモードのドキュメントが更新された • エージェントメッセージ処理が改善された • ページのwaitForTimeout機能が追加された • キャッシュが有効な場合のみXPathを計算するようにエージェントが更新された • アクション後のスクリーンショット機能が追加された • エージェントのロギングが改善された • Slackの参照がDiscordに置き換えられた • ツール関数と型がエクスポートされた • 空のオブジェクトを強制するオプションパラメータが追加された • AI SDKとのollamaサポートが修正された

browserbase/stagehand2026-01-09

releasetool

langgraph-sdk==0.3.2

この記事は、langgraph-sdkのバージョン0.3.2のリリースに関する情報を提供しています。このリリースでは、cron.on_run_completedのサポートが新たに追加され、ドキュメントが削除されるという変更が行われました。リリース日は2023年1月9日で、GitHub上でのコミットが確認されています。 • 新機能としてcron.on_run_completedのサポートが追加された • ドキュメントが削除された • リリース日は2023年1月9日 • バージョンは0.3.2である • GitHubでのコミットが確認されている

langchain-ai/langgraph2026-01-09

releasetool

Supercharging LLMs: Scalable RL with torchforge and Weaver

この記事では、MetaのPyTorchチームが開発したtorchforgeというPyTorchネイティブの強化学習（RL）ライブラリについて説明しています。torchforgeは、大規模な言語モデル（LLM）のポストトレーニングにおけるRLのスケーラビリティを向上させるために設計されており、512-GPUクラスターでの実験を通じてその効果が実証されました。特に、Weaverという検証システムと組み合わせることで、研究者は報酬設計やポリシー更新を迅速に行うことができ、インフラの複雑さを気にせずにRLアルゴリズムに集中できるようになります。torchforgeは、シングルノードからマルチノードクラスターまでスケール可能で、強化学習の実装を簡素化します。 • torchforgeは大規模なLLMのポストトレーニングにおけるRLのスケーラビリティを向上させるためのライブラリである。 • 512-GPUクラスターでの実験により、RLの実行が容易になった。 • Weaverは人間の注釈なしで生産レベルの報酬信号を提供する。 • Forgeは、インフラの複雑さを排除し、研究者がRLアルゴリズムに集中できるようにする。 • 強化学習の設計、ポリシー更新、検証戦略の反復が容易になる。

PyTorch Blog2026-01-09

librarytool

Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI

Quantized models can be seamlessly deployed on Amazon SageMaker AI using a few lines of code. In this post, we explore why quantization matters—how it enables lower-cost inference, supports deployment on resource-constrained hardware, and reduces both the financial and environmental impact of modern LLMs, while preserving most of their original performance. We also take a deep dive into the principles behind PTQ and demonstrate how to quantize the model of your choice and deploy it on Amazon SageMaker.

AWS Machine Learning Blog2026-01-09

cloudtool

langchain-core==1.2.7

この記事は、Langchainのコアライブラリのバージョン1.2.7のリリースに関する情報を提供しています。このリリースでは、いくつかのバグ修正と新機能が追加されました。具体的には、HTMLリンク抽出において無視するファイル拡張子が増え、メッセージの要約に関する機能が改善されました。また、LengthBasedExampleSelectorにおける空の例に対するテストが追加され、オプション引数を持つ関数の厳密なスキーマ生成が修正されました。さらに、カスタムメッセージセパレーターのサポートや、GPT-2トークナイザー使用時の警告が追加されました。これらの変更により、Langchainの機能性と安定性が向上しています。 • Langchainコアライブラリのバージョン1.2.7がリリースされた。 • HTMLリンク抽出で無視するファイル拡張子が追加された。 • LengthBasedExampleSelectorに空の例に対するテストが追加された。 • オプション引数を持つ関数の厳密なスキーマ生成が修正された。 • カスタムメッセージセパレーターのサポートが追加された。 • GPT-2トークナイザー使用時の警告が追加された。

langchain-ai/langchain2026-01-09

libraryrelease

How Beekeeper optimized user personalization with Amazon Bedrock

Beekeeper’s automated leaderboard approach and human feedback loop system for dynamic LLM and prompt pair selection addresses the key challenges organizations face in navigating the rapidly evolving landscape of language models.

AWS Machine Learning Blog2026-01-09

apicloudtool

Sentiment Analysis with Text and Audio Using AWS Generative AI Services: Approaches, Challenges, and Solutions

This post, developed through a strategic scientific partnership between AWS and the Instituto de Ciência e Tecnologia Itaú (ICTi), P&D hub maintained by Itaú Unibanco, the largest private bank in Latin America, explores the technical aspects of sentiment analysis for both text and audio. We present experiments comparing multiple machine learning (ML) models and services, discuss the trade-offs and pitfalls of each approach, and highlight how AWS services can be orchestrated to build robust, end-to-end solutions. We also offer insights into potential future directions, including more advanced prompt engineering for large language models (LLMs) and expanding the scope of audio-based analysis to capture emotional cues that text data alone might miss.

AWS Machine Learning Blog2026-01-09

frameworktool

Architecting TrueLook’s AI-powered construction safety system on Amazon SageMaker AI

This post provides a detailed architectural overview of how TrueLook built its AI-powered safety monitoring system using SageMaker AI, highlighting key technical decisions, pipeline design patterns, and MLOps best practices. You will gain valuable insights into designing scalable computer vision solutions on AWS, particularly around model training workflows, automated pipeline creation, and production deployment strategies for real-time inference.

AWS Machine Learning Blog2026-01-09

tool

This Year with ChatGPT

Set goals and stick to them all year with ChatGPT.

YouTube OpenAI2026-01-09

OpenAI and SoftBank Group partner with SB Energy

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に簡単に統合できるよう設計されており、ユーザーは特別な設定を行うことなく利用を開始できます。これにより、開発の効率が大幅に向上し、エラーの削減にも寄与します。さらに、AIの学習能力により、使用するほどに精度が向上する点も特徴です。 • AI技術を活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への簡単な統合 • 開発効率の向上とエラー削減 • AIの学習能力による精度向上

OpenAI Blog2026-01-09

tool

Warp Specialization in Triton: Design and Roadmap

Tritonコンパイラは、AIカーネル向けにパフォーマンスポータブルなコードとランタイムを生成することを目指しています。Triton開発者コミュニティは、オペレーターのスケジューリング、メモリ割り当て、レイアウト管理の改善に取り組んでおり、特にカーネルの最適化が複雑化する中で、SOTAパフォーマンスを維持するのが難しくなっています。ワープ専門化は、GPU上でのカーネルパフォーマンスを向上させるための技術で、各ワープに特化したコードパスを持つことで、制御フローの分岐によるパフォーマンス低下を減少させ、レイテンシの隠蔽を改善します。autoWSは、OSS Tritonの上に構築されており、手動、TorchInductor、Helion生成のカーネルに対して有効化できます。現在の実装は、HopperおよびBlackwellアクセラレータをサポートしており、複雑なカーネルの最適化を支援します。今後の計画についても言及されており、Triton開発者コミュニティからのフィードバックを求めています。 • TritonコンパイラはAIカーネル向けにパフォーマンスポータブルなコードを生成することを目指している。 • ワープ専門化はGPU上でのカーネルパフォーマンスを向上させる技術である。 • autoWSはOSS Tritonの上に構築され、手動、TorchInductor、Helion生成のカーネルに対応している。 • ワープ専門化により、制御フローの分岐によるパフォーマンス低下を減少させ、レイテンシの隠蔽を改善する。 • 現在の実装はHopperおよびBlackwellアクセラレータをサポートしている。

PyTorch Blog2026-01-09

librarytool

Datadog uses Codex for system-level code review

OpenAI Blog2026-01-09

tool

Release v3.39.1

RooCodeIncのRoo-Codeリポジトリでのリリースv3.39.1では、いくつかの重要な修正が行われた。具体的には、ネイティブツール呼び出しのストリーミング中にファイルパスの安定性を確保するための修正、Geminiの思考署名の持続性を無効にして署名エラーを防ぐ修正、Anthropic APIとの互換性を確保するためにminItemsの値を2から1に変更する修正が含まれている。これらの修正は、リリース日である2026年1月8日に行われた。 • ネイティブツール呼び出しのストリーミング中にファイルパスの安定性を確保する修正 • Geminiの思考署名の持続性を無効にして署名エラーを防ぐ修正 • Anthropic APIとの互換性を確保するためにminItemsの値を変更する修正

RooCodeInc/Roo-Code2026-01-08

releasetool

Insecure Agents Podcast: Certified Patches, Supply Chain Security, and AI Agents

Socket CEO Feross Aboukhadijeh joins Insecure Agents to discuss CVE remediation and why supply chain attacks require a different security approach.

Socket2026-01-08

apisecuritytool

langchain==1.2.3

この記事は、LangChainのバージョン1.2.3のリリースに関する情報を提供しています。このリリースでは、いくつかの重要な変更が行われました。具体的には、使用状況メタデータに基づいて要約機能が強化され、ツール呼び出しとAIメッセージのペアリングを保持するように修正されました。また、チャットモデルプロバイダーの推論をカバーするテストが追加され、Azure OpenAI埋め込みプロバイダーのマップにおけるコピー＆ペーストエラーが修正されました。これらの変更により、LangChainの機能が向上し、ユーザーにとっての利便性が増しています。 • LangChainのバージョン1.2.3がリリースされた。 • 要約機能が使用状況メタデータに基づいて強化された。 • ツール呼び出しとAIメッセージのペアリングを保持するように修正された。 • チャットモデルプロバイダーの推論をカバーするテストが追加された。 • Azure OpenAI埋め込みプロバイダーのマップにおけるコピー＆ペーストエラーが修正された。

langchain-ai/langchain2026-01-08

releasetool

PyTorch 2.9: FlexAttention Optimization Practice on Intel GPUs

PyTorch 2.9では、Intel GPU上でのFlexAttention最適化が紹介されています。最新のLLMフレームワークは、Grouped Query AttentionやMulti-Query Attentionなどの注意メカニズムを採用しており、これにより精度とパフォーマンスのバランスが取られています。FlexAttentionは、ユーザー定義のscore_modとmask_modを受け入れ、torch.compileを使用して効率的なFlashAttentionカーネルを自動生成します。FlexAttentionは、HuggingFaceやvLLMなどのプロジェクトで広く採用されており、最新のLLMモデルへの迅速な適応を可能にします。Intel GPU上でのFlexAttentionは、PyTorchの標準GPU動作に合わせており、異なるGPU間での一貫したパフォーマンスを提供します。Triton XPUを使用することで、Intel GPU上でのTritonカーネルの実行が可能になり、FlexAttentionの最適化が実現されています。 • 最新のLLMフレームワークは注意メカニズムを採用し、精度とパフォーマンスのバランスを取る。 • FlexAttentionはユーザー定義のscore_modとmask_modを使用し、効率的なFlashAttentionカーネルを自動生成する。 • FlexAttentionはHuggingFaceやvLLMなどで広く採用され、最新のLLMモデルへの迅速な適応を可能にする。 • Intel GPU上でのFlexAttentionはPyTorchの標準GPU動作に合わせており、一貫したパフォーマンスを提供する。 • Triton XPUを使用することで、Intel GPU上でのTritonカーネルの実行が可能になる。

PyTorch Blog2026-01-08

librarytool

LLM predictions for 2026, shared with Oxide and Friends

I joined a recording of the Oxide and Friends podcast on Tuesday to talk about 1, 3 and 6 year predictions for the tech industry. This is my second appearance …

Simon Willison's Blog2026-01-08

apicloudtool

Scaling medical content review at Flo Health using Amazon Bedrock (Part 1)

This two-part series explores Flo Health's journey with generative AI for medical content verification. Part 1 examines our proof of concept (PoC), including the initial solution, capabilities, and early results. Part 2 covers focusing on scaling challenges and real-world implementation. Each article stands alone while collectively showing how AI transforms medical content management at scale.

AWS Machine Learning Blog2026-01-08

apitool

Preparing for Appointments | with ChatGPT

Liz used ChatGPT throughout her teenage son’s cancer treatment to translate reports, prepare questions, and have more informed conversations with doctors.

YouTube OpenAI2026-01-08

Detect and redact personally identifiable information using Amazon Bedrock Data Automation and Guardrails

This post shows an automated PII detection and redaction solution using Amazon Bedrock Data Automation and Amazon Bedrock Guardrails through a use case of processing text and image content in high volumes of incoming emails and attachments. The solution features a complete email processing workflow with a React-based user interface for authorized personnel to more securely manage and review redacted email communications and attachments. We walk through the step-by-step solution implementation procedures used to deploy this solution. Finally, we discuss the solution benefits, including operational efficiency, scalability, security and compliance, and adaptability.

AWS Machine Learning Blog2026-01-08

apitool

Speed meets scale: Load testing SageMakerAI endpoints with Observe.AI’s testing tool

Observe.ai developed the One Load Audit Framework (OLAF), which integrates with SageMaker to identify bottlenecks and performance issues in ML services, offering latency and throughput measurements under both static and dynamic data loads. In this blog post, you will learn how to use the OLAF utility to test and validate your SageMaker endpoint.

AWS Machine Learning Blog2026-01-08

apicloudtool

AI's limited self-knowledge

Anthropic researcher Amanda Askell discusses the self-knowledge problem that AI models face.

YouTube Anthropic2026-01-08

How Google Got Its Groove Back and Edged Ahead of OpenAI

I picked up a few interesting tidbits from this Wall Street Journal piece on Google's recent hard won success with Gemini. Here's the origin of the name "Nano Banana": Naina …

Simon Willison's Blog2026-01-08

platform

Release v3.39.0

RooCodeIncのRoo-Codeリポジトリで、バージョン3.39.0がリリースされました。このリリースは2023年1月8日に行われ、GitHubでのコミットはGPG署名によって確認されています。リリースに関する詳細な情報は提供されていませんが、リリースノートには新機能や修正点が含まれている可能性があります。ユーザーはGitHub上でこのリリースを確認し、必要に応じてアセットをダウンロードすることができます。 • Roo-Codeのバージョン3.39.0がリリースされた • リリース日は2023年1月8日 • コミットはGitHubのGPG署名で確認済み • リリースノートには新機能や修正点が含まれる可能性がある • ユーザーはGitHubでリリースを確認できる

RooCodeInc/Roo-Code2026-01-08

releasetool

Netomi’s lessons for scaling agentic systems into the enterprise

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に簡単に統合できるよう設計されており、ユーザーは特別な設定を行うことなくすぐに利用を開始できます。これにより、開発の効率が大幅に向上し、エラーの削減にも寄与します。さらに、AIの学習能力により、使用するほどに精度が向上する点も特徴です。 • AI技術を活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への簡単な統合 • 開発効率の向上とエラー削減 • AIの学習能力による精度向上

OpenAI Blog2026-01-08

tool

15 best n8n practices for deploying AI agents in production

This guide walks you through the 15 best n8n practices for deploying production-ready AI Agents. Choose the best infrastructure, scale queue mode, handle errors, monitor, and deploy AI Agents reliably in n8n.

n8n Blog2026-01-08

apicloudtool

OpenAI for Healthcare

OpenAI Blog2026-01-08

tool

v0.18.4 Patch Release

DeepSpeedのv0.18.4パッチリリースでは、いくつかの重要な修正と機能改善が行われました。主な変更点には、コンパイルテストでの決定論的オプションの無効化、SuperOffloadOptimizer_Stage3のクラッシュ修正、AMDサポートの改善、DeepSpeed Async I/Oの待機中のハング修正、PyTorch 2.8/2.9との互換性のためのDeepCompileの修正などが含まれています。また、Python 3.11および3.12のテストが有効化され、AWS上でのCIワークフローが追加されました。これにより、DeepSpeedの信頼性とパフォーマンスが向上し、ユーザーにとっての利便性が増しています。 • v0.18.4パッチリリースでの主な修正と改善が行われた • 決定論的オプションを無効化し、SuperOffloadOptimizer_Stage3のクラッシュを修正 • AMDサポートの改善が実施された • DeepSpeed Async I/Oの待機中のハングを修正し、信頼性を向上させた • PyTorch 2.8/2.9との互換性を確保するための修正が行われた • Python 3.11および3.12のテストが追加された • AWS上でのCIワークフローが新たに導入された

microsoft/DeepSpeed2026-01-07

releasetool

langchain==1.2.2

この記事は、Langchainのバージョン1.2.2のリリースに関する情報を提供しています。このリリースでは、いくつかの重要な修正が行われました。具体的には、バージョンを検証するためのテストが追加され、計画ミドルウェアにおけるtodoツールの並行使用に関する問題が修正されました。また、モデル呼び出しのテストラップにおける型の修正も行われています。これにより、Langchainの安定性と機能性が向上しています。 • Langchainのバージョン1.2.2がリリースされた。 • バージョンを検証するためのテストが追加された。 • 計画ミドルウェアにおけるtodoツールの並行使用に関する問題が修正された。 • モデル呼び出しのテストラップにおける型の修正が行われた。 • これにより、Langchainの安定性と機能性が向上した。

langchain-ai/langchain2026-01-07

releasetool

Infosys partners with Cognition to expand engineering capacity and help scale its enterprise business

Infosys, a global leader in digital services and consulting, has partnered with Cognition to deploy Devin, the AI software engineer, across its organization and global client base.

Cognition AI Blog2026-01-07

aiplatformtool

Securely connecting your information and apps with ChatGPT Health

We’re introducing ChatGPT Health, a dedicated experience that securely brings your health information and ChatGPT’s intelligence together, to help you feel more informed, prepared, and confident navigating your health. When you choose to connect your health data, such as medical records or wellness apps, your responses are grounded in your own health information. You can also connect your Apple Health information and other wellness apps, such as Function, MyFitnessPal, Peloton. Apps may only be connected to your health data with your explicit permission, even if they’re already connected to ChatGPT for conversations outside of Health. And you’re always in control: disconnect an app at any time and it immediately loses access.

YouTube OpenAI2026-01-07

Personalized nutrition tips with ChatGPT

We’re introducing ChatGPT Health, a dedicated experience that securely brings your health information and ChatGPT’s intelligence together, to help you feel more informed, prepared, and confident navigating your health. Health conversations feel just like chatting with ChatGPT—but grounded in the information you’ve connected. You can upload photos and files and use search, deep research, voice mode and dictation. When relevant, ChatGPT can automatically reference your connected information to provide more relevant and personalized responses. For example, you might ask: “How’s my cholesterol trending?” or “Can you summarize my latest bloodwork before my appointment?”

YouTube OpenAI2026-01-07

Helping you choose the right insurance plan for you with ChatGPT

We’re introducing ChatGPT Health, a dedicated experience that securely brings your health information and ChatGPT’s intelligence together, to help you feel more informed, prepared, and confident navigating your health. With Health, ChatGPT can help you understand recent test results, prepare for appointments with your doctor, get advice on how to approach your diet and workout routine, or understand the tradeoffs of different insurance options based on your healthcare patterns. Join the waitlist: https://chatgpt.com/health/waitlist

YouTube OpenAI2026-01-07

Preparing for a doctor’s appointment with ChatGPT

We’re introducing ChatGPT Health, a dedicated experience that securely brings your health information and ChatGPT’s intelligence together, to help you feel more informed, prepared, and confident navigating your health. Health is designed to support, not replace, medical care. It is not intended for diagnosis or treatment. Instead, it helps you navigate everyday questions and understand patterns over time—not just moments of illness—so you can feel more informed and prepared for important medical conversations. Join the waitlist: https://chatgpt.com/health/waitlist

YouTube OpenAI2026-01-07

Quoting Adam Wathan

[...] the reality is that 75% of the people on our engineering team lost their jobs here yesterday because of the brutal impact AI has had on our business. And …

Simon Willison's Blog2026-01-07

apitool

Understanding your Scan Results | with ChatGPT

Burt uses ChatGPT to navigate life with two forms of cancer, supporting him in understanding scans, preparing for appointments, and explaining complex medical information to family.

YouTube OpenAI2026-01-07

Best AI Coding Tools for Developers in 2026

The best AI coding tools for developers in 2026. From IDEs to code review, find tools that work in real codebases without breaking your workflow.

Builder.io Blog2026-01-07

toolui

How we made v0 an effective coding agent

v0’s composite AI pipeline boosts reliability by fixing errors in real time. Learn how dynamic system prompts, LLM Suspense, and autofixers work together to deliver stable, working web app generations at scale.

Vercel Blog2026-01-07

apitool

How Tolan builds voice-first AI with GPT-5.1

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特にエラーの検出やコードの最適化に役立ちます。具体的には、AIがリアルタイムでコードを分析し、改善点を提案する機能が搭載されています。また、ユーザーインターフェースは直感的で使いやすく、導入も簡単です。さらに、他の開発環境との互換性も考慮されており、幅広いプラットフォームで利用可能です。これにより、開発者は生産性を向上させることが期待されます。 • AI技術を活用した新しい開発ツールの紹介 • リアルタイムでコードを分析し、改善点を提案する機能 • 直感的で使いやすいユーザーインターフェース • 簡単な導入プロセス • 幅広いプラットフォームとの互換性 • 生産性向上が期待される

OpenAI Blog2026-01-07

tool

3行で始める文章検索 ― txtai入門

AI ShiftのTECH BLOGです。AI技術の情報や活用方法などをご案内いたします。

AI-Shift Tech Blog2026-01-07

apilibrarytool

Quoting Robin Sloan

AGI is here! When exactly it arrived, we’ll never know; whether it was one company’s Pro or another company’s Pro Max (Eddie Bauer Edition) that tip-toed first across the line …

Simon Willison's Blog2026-01-07

platform

Understanding Inflammation | with ChatGPT

Living with heart failure, Steve uses ChatGPT to carry out his doctor’s care plan by tracking his diet, medications, and inflammation.

YouTube OpenAI2026-01-07

langchain==1.2.1

この記事は、Langchainのバージョン1.2.1のリリースに関する情報を提供しています。このリリースでは、メッセージ要約のためにget_buffer_stringを使用する修正や、テストモデルの型修正、PII（個人識別情報）に関するテストの型修正、ツールスキーマからの注入引数の除外など、さまざまな修正が行われました。また、ShellSession.execute()におけるレースコンディションの解決や、Googleの生成AIプロバイダーへのサポート追加などの新機能も含まれています。さらに、ドキュメントの改善やCIチェックの追加も行われています。 • Langchainのバージョン1.2.1がリリースされた。 • メッセージ要約のためにget_buffer_stringを使用する修正が行われた。 • テストモデルやPIIに関する型修正が実施された。 • ShellSession.execute()のレースコンディションが解決された。 • Googleの生成AIプロバイダーへのサポートが追加された。

langchain-ai/langchain2026-01-07

releasetool

Introducing ChatGPT Health

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に統合できるため、導入が容易である点も強調されています。さらに、AIによるコード生成は、開発の効率を大幅に向上させることが期待されています。 • AI技術を活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への統合が容易 • 開発効率の向上が期待される

OpenAI Blog2026-01-07

tool

A field guide to sandboxes for AI

This guide to the current sandboxing landscape by Luis Cardoso is comprehensive, dense and absolutely fantastic. He starts by differentiating between containers (which share the host kernel), microVMs (their own …

Simon Willison's Blog2026-01-06

tool

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog2026-01-06

apicloudtool

Balancing Motherhood | with ChatGPT

As a mom of two toddlers, Lauren uses ChatGPT to find time for herself through flexible workouts that fit into the unpredictable rhythms of her busy life.

YouTube OpenAI2026-01-06

Don’t ship another chat UI. Build real AI with AG-UI

Stop shipping chat UIs. Learn how AG-UI uses an event-driven protocol to build real AI apps with streaming, tools, and shared state.

logrocket-dev2026-01-06

librarytool

Commonwealth Bank of Australia builds AI fluency at scale

By rolling out ChatGPT Enterprise across its workforce, CBA is improving how teams work and deliver better outcomes for customers. Hear more from CEO Matt Comyn in this short video.

YouTube OpenAI2026-01-06

BNY People uses OpenAI

See how AI literacy scales across the enterprise when learning is built into the work. 20,000 BNY employees have created their own agents to build and update learning content. Get the full story at https://openai.com/index/bny/

YouTube OpenAI2026-01-06

Microsoft’s strategic AI datacenter planning enables seamless, large-scale NVIDIA Rubin deployments

Read how NVIDIA’s next-generation systems slot directly into infrastructure that has anticipated its requirements years ahead of the industry.

Microsoft AI Blog2026-01-05

cloudinfratool

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog2026-01-05

tool

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog2026-01-05

frameworktool

langchain-xai==1.2.1

この記事は、GitHub上でのlangchain-xaiのバージョン1.2.1のリリースに関する情報を提供しています。このリリースは2023年1月5日に行われ、主な変更点として、出力トータルにおける推論トークンのカウントを修正したことが挙げられています。これにより、推論の精度が向上することが期待されます。前のバージョンである1.2.0からの変更点が記載されており、GitHubの署名付きコミットとして確認されています。 • langchain-xaiのバージョン1.2.1が2023年1月5日にリリースされた • 主な修正点は出力トータルにおける推論トークンのカウントの修正 • この修正により推論の精度が向上することが期待される • リリースはGitHub上で行われ、署名付きコミットとして確認されている

langchain-ai/langchain2026-01-05

releasetool

BNY Sales uses OpenAI

See how AI gives teams more time for what matters most: BNY's clients. With deep research, advisors can cut planning time by 60% and use that time to deliver even more relevant and timely client experiences. Get the full story at https://openai.com/index/bny/

YouTube OpenAI2026-01-05

BNY Legal uses OpenAI

ChatGPT helps members of BNY's Legal team cut contract review time by up to 75%. Get the full story at https://openai.com/index/bny/

YouTube OpenAI2026-01-05

BNY builds “AI for everyone, everywhere” with OpenAI

Leaders at BNY share how they put AI directly into the hands of employees across the firm, powering Eliza 2.0 and enabling secure, responsible AI at scale. Get the full story at https://openai.com/index/bny/

YouTube OpenAI2026-01-05

Navigating Health | with ChatGPT

Every day, millions of people ask ChatGPT about their health – from breaking down medical information, preparing questions for their doctor’s appointments, to helping people manage their overall wellbeing.

YouTube OpenAI2026-01-05

Oxide and Friends Predictions 2026, today at 4pm PT

I joined the Oxide and Friends podcast last year to predict the next 1, 3 and 6 years(!) of AI developments. With hindsight I did very badly, but they're inviting …

Simon Willison's Blog2026-01-05

podcast

AI Gateway support for Claude Code

Use Vercel AI Gateway from Claude Code via the Anthropic-compatible endpoint, with a URL change and AI Gateway usage and cost tracking.

Vercel Blog2026-01-05

apitool

Introducing Falcon H1R 7B

A Blog post by Technology Innovation Institute on Hugging Face

Hugging Face Blog2026-01-05

platform

Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture

A Blog post by Technology Innovation Institute on Hugging Face

Hugging Face Blog2026-01-05

platform

NVIDIA brings agents to life with DGX Spark and Reachy Mini

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog2026-01-05

tool

The November 2025 inflection point

It genuinely feels to me like GPT-5.2 and Opus 4.5 in November represent an inflection point - one of those moments where the models get incrementally better in a way …

Simon Willison's Blog2026-01-04

platform

Helping people write code again

Something I like about our weird new LLM-assisted world is the number of people I know who are coding again, having mostly stopped as they moved into management roles or …

Simon Willison's Blog2026-01-04

tool

🥇Top AI Papers of the Week

The Top AI Papers of the Week (December 29 - January 4)

Elvis Saravia's NLP Blog2026-01-04

platform

Quoting Jaana Dogan

I'm not joking and this isn't funny. We have been trying to build distributed agent orchestrators at Google since last year. There are various options, not everyone is aligned... I …

Simon Willison's Blog2026-01-04

platform

Release v3.38.3

RooCodeIncのRoo-Codeリポジトリのバージョン3.38.3がリリースされ、いくつかの新機能とバグ修正が含まれています。新機能として、Context設定においてサブディレクトリから.roo/rulesおよびAGENTS.mdを再帰的に読み込むオプションが追加されました。また、OAuthリフレッシュトークンの取り扱いを強化することで、Claude Codeへの頻繁なサインインを防ぐ修正が行われました。さらに、native read_fileツールスキーマに最大同時ファイル読み込み数の制限が追加され、TTSのuseEffectにおいてlastMessage.textの型チェックを追加することでランタイムエラーを防ぐ修正も行われました。 • Context設定にサブディレクトリからの再帰的な読み込みオプションを追加 • OAuthリフレッシュトークンの取り扱いを強化し、頻繁なサインインを防止 • native read_fileツールスキーマに最大同時ファイル読み込み数の制限を追加 • TTSのuseEffectにおいてlastMessage.textの型チェックを追加し、ランタイムエラーを防止

RooCodeInc/Roo-Code2026-01-04

releasetool

🤖AI Agents Weekly: LLMs in 2025, YOLO in the Sandbox, Plan Caching for Agents, DeepTutor

LLMs in 2025, YOLO in the Sandbox, Plan Caching for Agents, DeepTutor

Elvis Saravia's NLP Blog2026-01-03

platform

Was Daft Punk Having a Laugh When They Chose the Tempo of Harder, Better, Faster, Stronger?

Depending on how you measure it, the tempo of Harder, Better, Faster, Stronger appears to be 123.45 beats per minute. This is one of those things that's so cool I'm …

Simon Willison's Blog2026-01-03

platform

langchain-core==1.2.6

この記事は、LangChainのコアライブラリのバージョン1.2.6のリリースに関する情報を提供しています。このリリースでは、Pydantic v2メソッドを使用するようにLangChainTracerが更新され、内部ヘルパー関数にドキュメンテーションストリングが追加されました。また、いくつかのドキュメントが更新され、依存関係としてmypyとruffのバージョンがそれぞれ1.19と1.14に引き上げられました。さらに、いくつかの型の修正やスタイルの改善が行われ、特にChatPromptTemplate.from_messagesメソッドでのタプルのサポートが追加されました。テストも強化され、特定のAPI呼び出しにおけるURLエンコーディングの修正が含まれています。 • LangChainTracerがPydantic v2メソッドを使用するように更新された • 内部ヘルパー関数にドキュメンテーションストリングが追加された • mypyとruffの依存関係がそれぞれ1.19と1.14に引き上げられた • ChatPromptTemplate.from_messagesメソッドでのタプルのサポートが追加された • 特定のAPI呼び出しにおけるURLエンコーディングの修正が行われた

langchain-ai/langchain2026-01-02

libraryrelease

Quoting Will Larson

My experience is that real AI adoption on real problems is a complex blend of: domain context on the problem, domain experience with AI tooling, and old-fashioned IT issues. I’m …

Simon Willison's Blog2026-01-02

platform

langchain-xai==1.2.0

この記事は、Langchainの新しいバージョンlangchain-xai==1.2.0のリリースに関するもので、主にバグ修正と機能改善が含まれています。具体的には、引用が一度だけストリーミングされるように修正され、ストリーム使用メタデータがデフォルトでストリーミングされるようになりました。また、シリアル化に関するパッチや、OpenAIのトークンカウントにおけるfunction_callブロックのフィルタリングも行われています。これにより、GPT-5シリーズの最大入力トークン数の更新も含まれています。 • 引用が一度だけストリーミングされるように修正された。 • ストリーム使用メタデータがデフォルトでストリーミングされるようになった。 • シリアル化に関するパッチが適用された。 • OpenAIのトークンカウントにおいてfunction_callブロックがフィルタリングされるようになった。 • GPT-5シリーズの最大入力トークン数が更新された。

langchain-ai/langchain2026-01-02

apireleasetool

December 2025 sponsors-only newsletter

I sent the December edition of my sponsors-only monthly newsletter. If you are a sponsor (or if you start a sponsorship now) you can access a copy here. In the …

Simon Willison's Blog2026-01-02

platform

Quoting Ben Werdmuller

[Claude Code] has the potential to transform all of tech. I also think we’re going to see a real split in the tech industry (and everywhere code is written) between …

Simon Willison's Blog2026-01-02

platform

書籍『作って学ぶAIエージェント』技術レビューのご案内（Member 向け）

現在執筆中の書籍『作って学ぶAIエージェント』について、原稿テキスト段階での技術レビューを実施します。目的本レビューの目的は主に以下です。 * 技術的に致命的な誤りの検出 * 説明の前提不足や誤解を招く表現の指摘 * 現行ツール・APIとの不整合の確認あわせて、書籍の構成・内容に対する要望や改善提案も受け付けます。（これは必須ではなく、可能な範囲で構いません）やってもらいたいこと * 書籍の内容を読み、必要に応じてサンプルコードを実際に動かす * 動作しない箇所、分かりづらい点、前提が不足している点があれば報告する * 内容や構成について要望・違和感があれば共有するレビューの深さや範囲は参加者に委ねます。 ※ 技術レビューにあたって発生する API 利用料金等は、恐れ入りますが各自のご負担となります。レビュー自体は原稿（Markdown／コード）を読む形でも十分に行っていただけます。進め方 * レビュー用に期間限定の Private GitHub Repository を用意しました * 対象は原稿テキスト（Markdown／

Lai.so Blog2026-01-01

apiframeworktool

2025: The year in LLMs

This is the third in my annual series reviewing everything that happened in the LLM space over the past 12 months. For previous years see Stuff we figured out about …

Simon Willison's Blog2025-12-31

platform

Codex cloud is now called Codex web

It looks like OpenAI's Codex cloud (the cloud version of their Codex coding agent) was quietly rebranded to Codex web at some point in the last few days. Here's a …

Simon Willison's Blog2025-12-31

apitool

Release v3.38.2

RooCodeIncのGitHubリポジトリで公開されたリリースv3.38.2では、エージェントスキル仕様に合わせたスキルシステムの調整や、ファイル作成時のパスのトランケートを防ぐ修正が行われた。また、CerebrasのmaxTokensを16384に更新し、レートリミット待機表示の修正も含まれている。さらに、ドキュメント内のTodoリスト動画をコンテキスト管理動画に置き換える変更も行われた。これらの変更は、開発者のコラボレーションによって実施され、リリースは2025年12月31日に行われた。 • エージェントスキル仕様に合わせたスキルシステムの調整 • ファイル作成時のパスのトランケートを防ぐ修正 • CerebrasのmaxTokensを16384に更新 • レートリミット待機表示の修正 • ドキュメント内の動画の更新

RooCodeInc/Roo-Code2025-12-31

releasetool

Quoting Armin Ronacher

[...] The puzzle is still there. What’s gone is the labor. I never enjoyed hitting keys, writing minimal repro cases with little insight, digging through debug logs, or trying to …

Simon Willison's Blog2025-12-30

platform

1.1.0 - 2025-12-30

この記事は、OpenHandsのバージョン1.1.0のリリースノートを提供しています。このリリースでは、CLI認証のためのOAuth 2.0デバイスフローが追加され、変更タブにリフレッシュボタンが追加されました。また、会話パネルに会話をエクスポートするボタンが追加され、Forgejoとの統合も行われました。初期化プロセスがmicromambaからtiniに変更され、tmuxの子プロセスが適切に管理されるようになりました。ローカル（非Docker）実行では、ホスト書き込み可能なパスがデフォルトで使用され、Playwrightのダウンロードが/workspaceから外され、パーミッションエラーが防止され、ファイルの検索が容易になりました。複数のUIおよびパフォーマンスの問題も修正されました。 • OAuth 2.0デバイスフローの追加によりCLI認証が改善された • 変更タブにリフレッシュボタンが追加された • 会話パネルにエクスポートボタンが追加された • Forgejoとの統合が行われた • 初期化プロセスがmicromambaからtiniに変更された • ローカル実行でホスト書き込み可能なパスがデフォルトになった • 複数のUIおよびパフォーマンスの問題が修正された

All-Hands-AI/OpenHands2025-12-30

releasetool

Quoting Liz Fong-Jones

In essence a language model changes you from a programmer who writes lines of code, to a programmer that manages the context the model has access to, prunes irrelevant things, …

Simon Willison's Blog2025-12-30

platform

Release v3.38.1

RooCodeIncのRoo-Codeリポジトリのリリースv3.38.1では、いくつかのバグ修正が行われた。具体的には、ツール結果のフラッシュ処理の修正、OpenAI互換プロバイダー向けのマージツール結果テキストのリバート、最大同時ファイル読み込み制限の強制、ディレクトリに対するread_fileツール使用時のフィードバックメッセージの改善、カスタムツールのIPCスキーマに関する処理の修正、マーケティングページのGitHubリポジトリURLの修正、プライバシーポリシーにおけるセキュリティ設定へのパスの明確化が含まれている。これらの修正により、ツールの安定性とユーザー体験が向上することが期待される。 • ツール結果のフラッシュ処理の修正 • OpenAI互換プロバイダー向けのマージツール結果テキストのリバート • 最大同時ファイル読み込み制限の強制 • ディレクトリに対するread_fileツール使用時のフィードバックメッセージの改善 • カスタムツールのIPCスキーマに関する処理の修正 • マーケティングページのGitHubリポジトリURLの修正 • プライバシーポリシーにおけるセキュリティ設定へのパスの明確化

RooCodeInc/Roo-Code2025-12-29

apireleasetool

Quoting Jason Gorman

The hard part of computer programming isn't expressing what we want the machine to do in code. The hard part is turning human thinking -- with all its wooliness and …

Simon Willison's Blog2025-12-29

tool

The latest AI news we announced in December

Here are Google’s latest AI updates from December 2025

Google AI Blog2025-12-29

apicloudtool

Migrate MLflow tracking servers to Amazon SageMaker AI with serverless MLflow

This post shows you how to migrate your self-managed MLflow tracking server to a MLflow App – a serverless tracking server on SageMaker AI that automatically scales resources based on demand while removing server patching and storage management tasks at no cost. Learn how to use the MLflow Export Import tool to transfer your experiments, runs, models, and other MLflow resources, including instructions to validate your migration's success.

AWS Machine Learning Blog2025-12-29

cloudtool

Build an AI-powered website assistant with Amazon Bedrock

This post demonstrates how to solve this challenge by building an AI-powered website assistant using Amazon Bedrock and Amazon Bedrock Knowledge Bases.

AWS Machine Learning Blog2025-12-29

apicloudtool

AI-first debugging: Tools and techniques for faster root cause analysis

AI-first debugging augments traditional debugging. Learn where AI helps, where it fails, and how to use it safely in production.

logrocket-dev2025-12-29

apitool

Quoting Aaron Levie

Jevons paradox is coming to knowledge work. By making it far cheaper to take on any type of task that we can possibly imagine, we’re ultimately going to be doing …

Simon Willison's Blog2025-12-29

platform

simonw/actions-latest

Today in extremely niche projects, I got fed up of Claude Code creating GitHub Actions workflows for me that used stale actions: actions/setup-python@v4 when the latest is actions/setup-python@v6 for example. …

Simon Willison's Blog2025-12-28

apitool

🥇Top AI Papers of the Week

The Top AI Papers of the Week (December 22-28)

Elvis Saravia's NLP Blog2025-12-28

platform

私のソフトウェア開発を一変させてしまった2025年のAIエージェントをふりかえる

2023年から段階的にAIを開発フローに組み込み、2025年は試行錯誤とツールの大きな変化、そしてエージェント化を経て、私のソフトウェア開発の進め方は明確に変化しました。ここで言う「変化」とは、単に作業が速くなった、便利になったという話ではありません。より具体的には「コードをタイピングする時間よりも、間接作業の比重と抽象的な思考・ロジックが増えた」という意味での変化です。より深刻なのは文字入力回数の増大です。その結果、マイクに向かって話したり、タイピングの練習といったプリミティブな活動を取り入れるようになりました。この変化は私だけのものではありません。Addy Osmaniは『Beyond Vibe Coding』で「開発者の役割はコードを書くことから、コードを指示すること（directing）へシフトしている」と述べ、アーキテクチャやデザインパターンといったシステム思考への集中を説いています。Latent SpaceのSwyxも「ソフトウェアエンジニアの強みは抽象化のレベルを上げることに最も長けている点だ」と指摘しています。この流れに対して「コーディングがつまらなくな

Lai.so Blog2025-12-28

apitool

Substack Network error = security content they don't allow to be sent

I just sent out the latest edition of the newsletter version of this blog. It's a long one! Turns out I wrote a lot of stuff in the past 10 …

Simon Willison's Blog2025-12-28

securitytool

Release v3.38.0

RooCodeIncのRoo-Codeのリリースv3.38.0では、エージェントスキルのサポートが追加され、プロンプト、ツール、リソースの再利用可能なパッケージを通じてRooの機能を拡張できるようになった。また、スラッシュコマンドのフロントマターにオプションのモードフィールドが追加され、コマンドがトリガーされた際に特定のモードに自動的に切り替わることが可能になった。さらに、カスタムツールにnpmパッケージと.envファイルのサポートが追加され、依存関係のインポートや環境変数へのアクセスが可能になった。簡易ファイル読み取りツール機能とOpenRouter Transform機能は削除され、ファイル読み取り体験が簡素化された。 • エージェントスキルのサポート追加により、Rooの機能が拡張可能に • スラッシュコマンドにオプションのモードフィールドを追加 • カスタムツールがnpmパッケージと.envファイルをサポート • 簡易ファイル読み取りツール機能を削除し、体験を簡素化 • OpenRouter Transform機能を削除

RooCodeInc/Roo-Code2025-12-27

releasetool

stagehand/server v3.2.0

GitHub上で公開されたstagehand/serverのバージョン3.2.0がリリースされました。このリリースでは、エージェントからツールを除外する機能が追加され、OpenAIおよびGoogle CUAのための安全確認コールバックのサポートが追加されました。また、アクション失敗時にエージェントキャッシュをリフレッシュする修正も行われました。これにより、エージェントの動作が改善され、より安全にツールを管理できるようになります。 • エージェントからツールを除外する機能が追加された • OpenAIおよびGoogle CUAのための安全確認コールバックがサポートされた • アクション失敗時にエージェントキャッシュをリフレッシュする修正が行われた • エージェントの動作が改善される • ツール管理がより安全になる

browserbase/stagehand2025-12-27

releasetool

@browserbasehq/[email protected]

この記事は、GitHub上で公開された@browserbasehq/stagehandのバージョン3.0.7のリリースに関する情報を提供しています。このリリースには、いくつかのパッチ変更が含まれており、特にハイブリッドモードの実験的な移動や、OpenAIおよびGoogle CUAのための安全確認サポートの追加が注目されます。また、エージェントの動作に関するいくつかのバグ修正や機能改善も行われています。具体的には、エージェントのキャッシュ管理の更新や、ページのホバー機能のサポートが追加されました。これにより、エージェントの動作がより安定し、効率的になることが期待されます。 • ハイブリッドモードの実験的な移動が行われた • OpenAIおよびGoogle CUAのための安全確認サポートが追加された • エージェントのキャッシュ管理が更新された • ページのホバー機能がサポートされた • いくつかのバグ修正が行われた

browserbase/stagehand2025-12-27

releasetool

Pluribus training data

In advocating for LLMs as useful and important technology despite how they're trained I'm beginning to feel a little bit like John Cena in Pluribus. Pluribus spoiler (episode 6) Given …

Simon Willison's Blog2025-12-27

platform

🤖AI Agents Weekly: MiniMax-M2.1, GLM-4.7, MiniMax-M2.1, LaMer Meta-RL, Google's 2025 AI Breakthroughs

MiniMax-M2.1, LLM Coding Workflows, GLM-4.7, MiniMax-M2.1, LaMer Meta-RL, Google's 2025 AI Breakthroughs

Elvis Saravia's NLP Blog2025-12-27

apiframeworktool

Quoting Boris Cherny

A year ago, Claude struggled to generate bash commands without escaping issues. It worked for seconds or minutes at a time. We saw early signs that it may become broadly …

Simon Willison's Blog2025-12-27

tool

langchain-tests==1.1.2

この記事は、Langchainのテストパッケージであるlangchain-testsのバージョン1.1.2のリリースに関する情報を提供しています。このリリースは2023年12月27日に行われ、主な変更点として、テキストスプリッターや標準テスト、CLIに対するruff TCおよびRUF012ルールの追加、ruff ISC001ルールの追加、コア部分のシリアライゼーションパッチが含まれています。これにより、テストの品質向上やバグ修正が図られています。 • langchain-testsのバージョン1.1.2がリリースされた • 主な変更点にはテキストスプリッターや標準テスト、CLIに対するruff TCおよびRUF012ルールの追加がある • ruff ISC001ルールの追加も行われた • コア部分のシリアライゼーションパッチが適用された • これによりテストの品質向上やバグ修正が図られた

langchain-ai/langchain2025-12-27

releasetool

2025 年に読んでよかった本

AI を活用するための技術というのはとりわけ新しいものではなく、過去の知見を基盤として構築されていることが多いです。それゆえに、AI 時代だからこそ基礎的な知識を体系的に学ぶことができる書籍に学ぶことに価値を求めるのです。この記事では 2025 年に読んで特に印象に残った本をいくつか紹介します。

azukiazusa のテックブログ22025-12-27

apisecuritytool

How Rob Pike got spammed with an AI slop "act of kindness"

Rob Pike (that Rob Pike) is furious. Here’s a Bluesky link for if you have an account there and a link to it in my thread viewer if you don’t. …

Simon Willison's Blog2025-12-26

tool

Building Jarvis: MCP and the future of AI with Kent C Dodds [REPEAT]

In this repeat episode, Kent C. Dodds came back on to the podcast with bold ideas and a game-changing vision for the future of AI and web development. In this episode, we dive into the Model Context Protocol (MCP), the power behind Epic AI Pro, and how developers can start building Jarvis-like assistants today. From replacing websites with MCP servers to reimagining voice interfaces and AI security, Kent lays out the roadmap for what's next, and why it matters right now. Don’t miss this fast-paced conversation about the tools and tech reshaping everything.

PodRocket2025-12-25

apicloudtool

2025年 AIエージェント元年を振り返る〜AI駆動なビジネスプロセスへの変革と実践〜

AI ShiftのTECH BLOGです。AI技術の情報や活用方法などをご案内いたします。

AI-Shift Tech Blog2025-12-24

frameworktool

stagehand/server v3.1.3

この記事は、GitHub上で公開されたstagehand/serverのバージョン3.1.3のリリースに関する情報を提供しています。このリリースでは、エージェントのドキュメントの更新や、環境変数からのGOOGLE_API_KEYの読み込みの修正、エージェント評価の追加、スクリーンショットコレクターの更新などが行われました。また、エラー処理やメモリのクリーンアップ、ユニットテストの追加、APIキーの自動読み込みの修正など、さまざまな技術的改善が含まれています。さらに、ハイブリッドCUA + DOMモードの追加や、OpenAPI生成のためのfastify-zod-openapiとzod v4の使用なども新たに導入されています。 • エージェントのドキュメントが更新された • GOOGLE_API_KEYを環境変数から読み込む修正が行われた • エージェント評価の追加が行われた • エラー処理とメモリのクリーンアップが実施された • ユニットテストが追加された • ハイブリッドCUA + DOMモードが新たに追加された • OpenAPI生成にfastify-zod-openapiとzod v4が使用された

browserbase/stagehand2025-12-24

releasetool

Programmatically creating an IDP solution with Amazon Bedrock Data Automation

In this post, we explore how to programmatically create an IDP solution that uses Strands SDK, Amazon Bedrock AgentCore, Amazon Bedrock Knowledge Base, and Bedrock Data Automation (BDA). This solution is provided through a Jupyter notebook that enables users to upload multi-modal business documents and extract insights using BDA as a parser to retrieve relevant chunks and augment a prompt to a foundational model (FM).

AWS Machine Learning Blog2025-12-24

apicloudtool

AI agent-driven browser automation for enterprise workflow management

Enterprise organizations increasingly rely on web-based applications for critical business processes, yet many workflows remain manually intensive, creating operational inefficiencies and compliance risks. Despite significant technology investments, knowledge workers routinely navigate between eight to twelve different web applications during standard workflows, constantly switching contexts and manually transferring information between systems. Data entry and validation tasks […]

AWS Machine Learning Blog2025-12-24

apicloudtool

Agentic QA automation using Amazon Bedrock AgentCore Browser and Amazon Nova Act

In this post, we explore how agentic QA automation addresses these challenges and walk through a practical example using Amazon Bedrock AgentCore Browser and Amazon Nova Act to automate testing for a sample retail application.

AWS Machine Learning Blog2025-12-24

apitool

Optimizing LLM inference on Amazon SageMaker AI with BentoML’s LLM- Optimizer

In this post, we demonstrate how to optimize large language model (LLM) inference on Amazon SageMaker AI using BentoML's LLM-Optimizer to systematically identify the best serving configurations for your workload.

AWS Machine Learning Blog2025-12-24

apicloudtool

Engineering with AI Podcast: The Promise of AI-First Development

Socket CTO Ahmad Nassri shares practical AI coding techniques, tools, and team workflows, plus what still feels noisy and why shipping remains human-l...

Socket2025-12-24

apitool

1.4.0

この記事は、Chromaのバージョン1.4.0のリリースに関する情報を提供しています。このリリースでは、ドキュメントの修正や新機能の追加、バグ修正が行われました。具体的には、BM25のマルチスレッド時の不具合修正、JavaScriptクライアントのベースURL指定のサポート追加、Rustクライアントの新機能追加などが含まれています。また、エージェントメモリガイドやコレクション検索の例もドキュメントに追加されました。これにより、Chromaの機能が向上し、ユーザーにとっての利便性が増しています。 • Chromaのバージョン1.4.0がリリースされた。 • ドキュメントの修正や新機能の追加が行われた。 • BM25のマルチスレッド時の不具合が修正された。 • JavaScriptクライアントにベースURL指定のサポートが追加された。 • Rustクライアントに新機能が追加された。 • エージェントメモリガイドやコレクション検索の例がドキュメントに追加された。

chroma-core/chroma2025-12-24

libraryreleasetool

cli-1.3.0

この記事は、GitHub上のchroma-coreリポジトリにおけるcli-1.3.0のリリースに関する情報を提供しています。このリリースは2023年12月24日に行われ、CLI（コマンドラインインターフェース）の最新バージョンが公開されました。リリースには、6つのアセットが含まれており、GitHubの検証済み署名で作成されたことが記載されています。記事は、リリースの詳細や変更点については触れていませんが、CLIのバージョン管理に関する基本的な情報を提供しています。 • CLIの最新バージョンcli-1.3.0がリリースされた • リリース日は2023年12月24日 • リリースには6つのアセットが含まれている • GitHubの検証済み署名で作成された

chroma-core/chroma2025-12-24

releasetool

Release v3.37.1

RooCodeIncのRoo-Codeのリリースv3.37.1では、いくつかのバグ修正と新機能が追加されました。主な修正点として、OpenAI用のネイティブツール定義をデフォルトで送信するようにし、モデル出力処理時の不正なレスポンスを防ぐためにreasoning_detailsの形状を保持することが挙げられます。また、メッセージの損失を防ぐために、askを待っている間にキューに入ったメッセージを排出する機能も追加されました。新機能としては、空のアシスタントメッセージに対するグレースリトライの追加や、OpenAI互換プロバイダー全体でのmergeToolResultTextの有効化が含まれています。さらに、プロンプト内でのネイティブツール使用ガイダンスを強化し、アカウント中心のサインアップフローを導入してオンボーディング体験を向上させました。 • OpenAI用のネイティブツール定義をデフォルトで送信する修正 • reasoning_detailsの形状を保持することで不正なレスポンスを防ぐ • askを待っている間にキューに入ったメッセージを排出する機能の追加 • 空のアシスタントメッセージに対するグレースリトライの追加 • OpenAI互換プロバイダー全体でのmergeToolResultTextの有効化 • プロンプト内でのネイティブツール使用ガイダンスの強化 • アカウント中心のサインアップフローの導入

RooCodeInc/Roo-Code2025-12-23

releasetool

Exploring the zero operator access design of Mantle

In this post, we explore how Mantle, Amazon's next-generation inference engine for Amazon Bedrock, implements a zero operator access (ZOA) design that eliminates any technical means for AWS operators to access customer data.

AWS Machine Learning Blog2025-12-23

apicloudsecurity

AWS AI League: Model customization and agentic showdown

In this post, we explore the new AWS AI League challenges and how they are transforming how organizations approach AI development. The grand finale at AWS re:Invent 2025 was an exciting showcase of their ingenuity and skills.

AWS Machine Learning Blog2025-12-23

tool

Accelerate Enterprise AI Development using Weights & Biases and Amazon Bedrock AgentCore

In this post, we demonstrate how to use Foundation Models (FMs) from Amazon Bedrock and the newly launched Amazon Bedrock AgentCore alongside W&B Weave to help build, evaluate, and monitor enterprise AI solutions. We cover the complete development lifecycle from tracking individual FM calls to monitoring complex agent workflows in production.

AWS Machine Learning Blog2025-12-23

apicloudtool

Advancing ADHD diagnosis: How Qbtech built a mobile AI assessment Model Using Amazon SageMaker AI

In this post, we explore how Qbtech streamlined their machine learning (ML) workflow using Amazon SageMaker AI, a fully managed service to build, train and deploy ML models, and AWS Glue, a serverless service that makes data integration simpler, faster, and more cost effective. This new solution reduced their feature engineering time from weeks to hours, while maintaining the high clinical standards required by healthcare providers.

AWS Machine Learning Blog2025-12-23

tool

Accelerating your marketing ideation with generative AI – Part 1: From idea to generation with the Amazon Nova foundation models

In this post, the first of a series of three, we focus on how you can use Amazon Nova to streamline, simplify, and accelerate marketing campaign creation through generative AI. We show how Bancolombia, one of Colombia’s largest banks, is experimenting with the Amazon Nova models to generate visuals for their marketing campaigns.

AWS Machine Learning Blog2025-12-23

tool

Google's year in review: 8 areas with research breakthroughs in 2025

2025年のGoogleの年次レビューでは、8つの研究分野における画期的な進展が紹介されています。これらの分野には、AIの進化、持続可能なエネルギー技術、医療の革新、量子コンピューティング、データプライバシーの強化、教育技術の向上、交通の効率化、そして新しいコミュニケーション手段が含まれています。特にAIの進化は、さまざまな業界において新しい可能性を開くものであり、持続可能なエネルギー技術は環境問題への対応に寄与することが期待されています。これらの研究成果は、今後の技術革新や社会の発展に大きな影響を与えるでしょう。 • 2025年における8つの研究分野の進展が報告されている • AIの進化が多くの業界に新しい可能性を提供する • 持続可能なエネルギー技術が環境問題への対応に寄与する • 医療の革新が健康管理の新しいアプローチを生む • 量子コンピューティングが計算能力を飛躍的に向上させる • データプライバシーの強化がユーザーの信頼を高める • 教育技術の向上が学習の質を改善する • 交通の効率化が都市のインフラに貢献する

DeepMind Blog2025-12-23

platform

Google's year in review: 8 areas with research breakthroughs in 2025

This year saw new AI models, transformative products and new breakthroughs in science and robotics.

Google AI Blog2025-12-23

platform

Introducing Visa Intelligent Commerce on AWS: Enabling agentic commerce with Amazon Bedrock AgentCore

In this post, we explore how AWS and Visa are partnering to enable agentic commerce through Visa Intelligent Commerce using Amazon Bedrock AgentCore. We demonstrate how autonomous AI agents can transform fragmented shopping and travel experiences into seamless, end-to-end workflows—from discovery and comparison to secure payment authorization—all driven by natural language.

AWS Machine Learning Blog2025-12-23

apicloudtool

AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems

A Blog post by ServiceNow-AI on Hugging Face

Hugging Face Blog2025-12-23

apitool

How to build agentic AI when your data can’t leave the network

Build privacy-first agentic AI using small language models. Learn how local-first architectures replace cloud LLMs for reasoning, RAG, and orchestration.

logrocket-dev2025-12-23

frameworktool

Pixel Portraits: AI generated trading cards

How Vercel built AI-generated pixel trading cards for Next.js Conf and Ship AI, then turned the same pipeline into a v0 template and festive holiday experiment.

Vercel Blog2025-12-23

tool

Release v3.37.0

RooCodeIncのRoo-Codeリポジトリのバージョン3.37.0がリリースされ、いくつかの新機能と修正が追加されました。新たにMiniMax M2.1が追加され、Minimax思考モデルの環境詳細処理が改善されました。また、Zaiプロバイダー向けに思考モードをサポートするGLM-4.7モデルが追加され、AIワークフローにシームレスに統合できるカスタムツール呼び出しの実験的機能も導入されました。XMLツールプロトコルの選択が非推奨となり、新しいタスクにはネイティブツールフォーマットが強制されます。さらに、OpenAIハンドラーでのストリーミング終了時にtool_call_endイベントを発生させる修正や、MCPツールの厳密モードを無効にする修正なども行われました。 • MiniMax M2.1の追加とMinimax思考モデルの環境詳細処理の改善 • Zaiプロバイダー向けのGLM-4.7モデルの追加 • カスタムツール呼び出しの実験的機能の導入 • XMLツールプロトコルの選択が非推奨に • OpenAIハンドラーでのtool_call_endイベントの修正 • MCPツールの厳密モードを無効にする修正

RooCodeInc/Roo-Code2025-12-23

releasetool

Cooking with Claude

I’ve been having an absurd amount of fun recently using LLMs for cooking. I started out using them for basic recipes, but as I’ve grown more confident in their culinary …

Simon Willison's Blog2025-12-23

apitool

langchain-core==0.3.81

この記事は、GitHub上でのlangchain-coreのバージョン0.3.81のリリースに関する情報を提供しています。このリリースは2023年12月23日に行われ、主な変更点としては、バージョン0.3.80からの修正が含まれています。特に、シリアライゼーションに関するパッチが適用されており、これによりデータの処理や保存に関する問題が改善されることが期待されます。リリースノートには、変更の詳細や関連するコミット情報が記載されています。 • バージョン0.3.81のリリース日: 2023年12月23日 • 主な変更点はシリアライゼーションに関するパッチの適用 • 前のバージョン0.3.80からの修正が含まれている • リリースノートには変更の詳細が記載されている

langchain-ai/langchain2025-12-23

libraryrelease

Forward Deployed Engineer(FDE)職はじめました

AI ShiftのTECH BLOGです。AI技術の情報や活用方法などをご案内いたします。

AI-Shift Tech Blog2025-12-23

apicloudtool

SpecBundle & SpecForge v0.2: Production-Ready Speculative Decoding Models and Framework

<h2><a id="tldr" class="anchor" href="#tldr" aria-hidden="true"><svg aria-hidden="true" class="octicon octicon-link" height="16" version="1.1" viewbox="0 0 1...

LMSYS Blog2025-12-23

librarytool

langchain-core==1.2.5

この記事は、Langchainのコアライブラリのバージョン1.2.5のリリースに関するもので、主にバグ修正と新機能の追加が含まれています。具体的には、シリアライズのパッチ、RunnablePickメソッドの戻り値の修正、@toolデコレーター内のField(description=...)の保持、ツールのargs_schemaからのデフォルト引数のポピュレート、get_buffer_string内でのdeprecatedなfunction_callの代わりにtool_callsの使用、@deprecatedにPEP 702 __deprecated__属性サポートの追加、ツールコールカウントのnull防止、ツールコールカウントの自動カウントと保存、count_tokens_approximatelyの代わりに'approximate'エイリアスの追加、ruffプレビュー規則の修正が行われています。 • バージョン1.2.5のリリースに伴うバグ修正と機能追加 • シリアライズのパッチが適用された • RunnablePickメソッドの戻り値が修正された • @toolデコレーター内のField(description=...)が保持されるようになった • ツールのargs_schemaからデフォルト引数がポピュレートされる • deprecatedなfunction_callの代わりにtool_callsが使用されるようになった • PEP 702 __deprecated__属性が@deprecatedに追加された • ツールコールカウントがnullにならないように修正された • ツールコールカウントが自動的にカウントされ保存されるようになった • 'approximate'エイリアスがcount_tokens_approximatelyの代わりに追加された

langchain-ai/langchain2025-12-22

libraryrelease

Kirana Store with ChatGPT

Turn your hard work into something big, with ChatGPT. 🎧: “Ajib Dastan Hai Yeh” by Lata Mangeshkar [Saregama Music]

YouTube OpenAI2025-12-22

Move Beyond Chain-of-Thought with Chain-of-Draft on Amazon Bedrock

This post explores Chain-of-Draft (CoD), an innovative prompting technique introduced in a Zoom AI Research paper Chain of Draft: Thinking Faster by Writing Less, that revolutionizes how models approach reasoning tasks. While Chain-of-Thought (CoT) prompting has been the go-to method for enhancing model reasoning, CoD offers a more efficient alternative that mirrors human problem-solving patterns—using concise, high-signal thinking steps rather than verbose explanations.

AWS Machine Learning Blog2025-12-22

apitool

Deploy Mistral AI’s Voxtral on Amazon SageMaker AI

In this post, we demonstrate hosting Voxtral models on Amazon SageMaker AI endpoints using vLLM and the Bring Your Own Container (BYOC) approach. vLLM is a high-performance library for serving large language models (LLMs) that features paged attention for improved memory management and tensor parallelism for distributing models across multiple GPUs.

AWS Machine Learning Blog2025-12-22

cloudtool

Enhance document analytics with Strands AI Agents for the GenAI IDP Accelerator

To address the need for businesses to quickly analyze information and unlock actionable insights, we are announcing Analytics Agent, a new feature that is seamlessly integrated into the GenAI IDP Accelerator. With this feature, users can perform advanced searches and complex analyses using natural language queries without SQL or data analysis expertise. In this post, we discuss how non-technical users can use this tool to analyze and understand the documents they have processed at scale with natural language.

AWS Machine Learning Blog2025-12-22

apitool

Build a multimodal generative AI assistant for root cause diagnosis in predictive maintenance using Amazon Bedrock

In this post, we demonstrate how to implement a predictive maintenance solution using Foundation Models (FMs) on Amazon Bedrock, with a case study of Amazon's manufacturing equipment within their fulfillment centers. The solution is highly adaptable and can be customized for other industries, including oil and gas, logistics, manufacturing, and healthcare.

AWS Machine Learning Blog2025-12-22

tool

60 of our biggest AI announcements in 2025

Look back on Google AI news in 2025 across Gemini, Search, Pixel and more products.

Google AI Blog2025-12-22

platformtool

Using Claude in Chrome to navigate out the Cloudflare dashboard

I just had my first success using a browser agent - in this case the Claude in Chrome extension - to solve an actual problem. A while ago I set …

Simon Willison's Blog2025-12-22

apicloudtool

A practical guide to building your RAG pipeline in n8n

Explore how to build a full RAG pipeline in n8n without heavy frameworks. Compare code-first approaches to visual workflows for faster iteration and easier maintenance.

n8n Blog2025-12-22

apitool

Multi-agent systems: Frameworks & step-by-step tutorial

Discover multi-agent AI patterns, communication, costs, risks, and real-world use cases. Compare visual builders like n8n with code-first SDKs.

n8n Blog2025-12-22

apicloudtool

GLM-4.7 available on Vercel AI Gateway

You can now access the Z.ai GLM-4.7 model on Vercel's AI Gateway with no other provider accounts required.

Vercel Blog2025-12-22

apitool

AI SDK 6

Introducing agents, tool execution approval, DevTools, full MCP support, reranking, image editing, and more.

Vercel Blog2025-12-22

apilibrarytool

【promptomatix】LLMのベンチマークスコアを7分、100円であげる

AI ShiftのTECH BLOGです。AI技術の情報や活用方法などをご案内いたします。

AI-Shift Tech Blog2025-12-22

apitool

One in a million: celebrating the customers shaping AI’s future

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。特に、AIを用いたコード補完機能や自動生成機能が強調されており、開発者の生産性を向上させることが期待されています。また、これらのツールは、特定のプログラミング言語やフレームワークに依存せず、幅広い環境で利用可能であることが述べられています。さらに、AI技術の進化に伴い、今後の開発プロセスがどのように変化するかについても考察されています。 • AI技術を活用した新しい開発ツールの紹介 • コード補完機能や自動生成機能が開発者の生産性を向上させる • 特定のプログラミング言語やフレームワークに依存しない • 幅広い環境で利用可能 • AI技術の進化による開発プロセスの変化についての考察

OpenAI Blog2025-12-22

tool

Continuously hardening ChatGPT Atlas against prompt injection

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。特に、AIを用いたコード生成やデバッグ支援の機能が強調されており、開発者が効率的に作業を進めるための具体的な手法が紹介されています。また、これらのツールがどのようにして開発プロセスを改善し、エラーを減少させるかについても詳しく述べられています。さらに、実装方法や使用する際の注意点についても触れられており、実際の開発現場での適用例が示されています。 • AI技術を活用した新しい開発ツールの紹介 • コード生成やデバッグ支援の機能が強調されている • 開発プロセスの改善とエラーの減少に寄与する • 具体的な手法と実装方法が説明されている • 使用時の注意点や適用例についても言及されている

OpenAI Blog2025-12-22

librarytool

🥇Top AI Papers of the Week

The Top AI Papers of the Week (December 15-21)

Elvis Saravia's NLP Blog2025-12-21

platform

🤖AI Agents Weekly: Gemini 3 Flash, GPT Image 1.5, Mistral OCR 3, GPT-5.2-Codex,NVIDIA Nemotron 3, Budget-aware Agent Scaling

Gemini 3 Flash, GPT Image 1.5, Mistral OCR 3, GPT-5.2-Codex,NVIDIA Nemotron 3, Budget-aware Agent Scaling

Elvis Saravia's NLP Blog2025-12-20

apitool

Release v3.36.16

RooCodeIncのGitHubリポジトリで公開されたリリースv3.36.16は、2025年12月19日に行われたもので、主にVS CodeのLanguage Model APIプロバイダーを使用する際に発生する400エラーを解決するためにツールスキーマを正規化する修正が含まれています。この修正は、PR #10221によって提案され、hannesrudolphによって実装されました。 • VS CodeのLanguage Model APIプロバイダー使用時の400エラーを解決するための修正 • ツールスキーマの正規化が行われた • PR #10221によって提案された修正 • hannesrudolphが貢献した

RooCodeInc/Roo-Code2025-12-20

releasetool

v1.13.2

この記事は、Faissのバージョン1.13.2のリリースに関するもので、2025年12月19日に公開されました。このリリースでは、二段階検索フィルタリングの効果を追跡するためのRaBitQStatsの追加や、IndexIVFRaBitQFastScanのマルチビットサポートが含まれています。また、IndexRefinePanoramaの実装や、IndexHNSWFlatPanoramaの再適用、バイナリの後方互換性チェックの追加も行われました。さらに、Intel ScalableVectorSearchのサポートが有効化され、いくつかのバグ修正やドキュメントの更新も行われています。 • 二段階検索フィルタリングの効果を追跡するRaBitQStatsを追加 • IndexIVFRaBitQFastScanにマルチビットサポートを追加 • IndexRefinePanoramaを実装 • IndexHNSWFlatPanoramaの後方互換性を持つ再適用 • Intel ScalableVectorSearchのサポートを有効化 • バイナリの後方互換性チェックを追加 • いくつかのバグ修正とドキュメントの更新

facebookresearch/faiss2025-12-19

releasetool

Quoting Andrej Karpathy

In 2025, Reinforcement Learning from Verifiable Rewards (RLVR) emerged as the de facto new major stage to add to this mix. By training LLMs against automatically verifiable rewards across a …

Simon Willison's Blog2025-12-19

platform

Release v3.36.15

RooCodeIncのGitHubリポジトリで公開されたリリースv3.36.15では、Claude Sonnet 4のための1Mコンテキストウィンドウのベータサポートが追加され、複雑なタスクに対して大幅に大きなコンテキストが可能になった。また、LM StudioおよびQwen-Codeプロバイダーに対するネイティブツール呼び出しサポートが追加され、ローカルモデルとの互換性が向上した。OpenAI互換プロバイダー向けにネイティブツール呼び出しのデフォルトも追加され、より多くの構成でのネイティブ機能呼び出しが拡張された。Requestyプロバイダーに対するネイティブツール呼び出しも有効化され、APIエラーハンドリングが改善され、エラーメッセージが明確になり、ユーザーフィードバックが向上した。チャットエラーからのダウンロード可能なエラー診断が追加され、問題のトラブルシューティングと報告が容易になった。モデルリストの更新が正しく行われるように、モデルのリフレッシュボタンの不具合も修正された。 • 1Mコンテキストウィンドウのベータサポートが追加され、複雑なタスクに対応可能に • LM StudioおよびQwen-Codeプロバイダーに対するネイティブツール呼び出しサポートが追加 • OpenAI互換プロバイダー向けにネイティブツール呼び出しのデフォルトが追加 • Requestyプロバイダーに対するネイティブツール呼び出しが有効化 • APIエラーハンドリングが改善され、エラーメッセージが明確に • チャットエラーからのダウンロード可能なエラー診断が追加 • モデルリストの更新が正しく行われるように不具合が修正された

RooCodeInc/Roo-Code2025-12-19

apireleasetool

langchain-core==1.2.4

この記事は、LangChainのコアライブラリのバージョン1.2.4のリリースに関する情報を提供しています。このリリースでは、LangChainTracerのメタデータにusage_metadataが追加され、イテレータ入力のトレースの永続化が遅延される修正が行われました。また、いくつかのドキュメント文字列の修正も含まれています。これにより、LangChainのトレーシング機能が向上し、ユーザーがより効果的にトレースデータを管理できるようになります。 • LangChainのコアライブラリのバージョン1.2.4がリリースされた。 • LangChainTracerのメタデータにusage_metadataが追加された。 • イテレータ入力のトレースの永続化が遅延される修正が行われた。 • いくつかのドキュメント文字列が修正された。 • トレーシング機能の向上により、ユーザーはトレースデータをより効果的に管理できる。

langchain-ai/langchain2025-12-19

releasetool

Sam Rose explains how LLMs work with a visual essay

Sam Rose is one of my favorite authors of explorable interactive explanations - here's his previous collection. Sam joined ngrok in September as a developer educator. Here's his first big …

Simon Willison's Blog2025-12-19

platform

Introducing SOCI indexing for Amazon SageMaker Studio: Faster container startup times for AI/ML workloads

Today, we are excited to introduce a new feature for SageMaker Studio: SOCI (Seekable Open Container Initiative) indexing. SOCI supports lazy loading of container images, where only the necessary parts of an image are downloaded initially rather than the entire container.

AWS Machine Learning Blog2025-12-19

cloudtool

2025 LLM Year in Review

2025 Year in Review of LLM paradigm changes

Andrej Karpathy's Blog2025-12-19

platform

40 of our most helpful AI tips from 2025

Learn more about the AI tips and tools Google shared in 2025.

Google AI Blog2025-12-19

frameworktool

5 ways AI agents will transform the way we work in 2026

Today, Google Cloud dropped its 2026 AI Agent Trends Report.

Google AI Blog2025-12-19

apicloudtool

Introducing GPT-5.2-Codex

The latest in OpenAI's Codex family of models (not the same thing as their Codex CLI or Codex Cloud coding agent tools). GPT‑5.2-Codex is a version of GPT‑5.2⁠ further optimized …

Simon Willison's Blog2025-12-19

platform

Release v3.36.14

RooCodeIncのRoo-Codeリポジトリでのリリースv3.36.14では、Claudeモデルに対するネイティブツール呼び出しサポートがVertex AIに追加され、ツール間のインタラクションがより効率的かつ信頼性の高いものになりました。また、OpenAIとの互換性を確保するためにJSONスキーマのフォーマット値のストリッピングに関する問題が修正され、ツールの実行に失敗した際のエラーハンドリングが改善され、優雅なリトライメカニズムが導入されました。これにより、ツールが失敗した場合でも信頼性が向上しました。 • Claudeモデルに対するネイティブツール呼び出しサポートが追加された • OpenAIとの互換性のためにJSONスキーマのフォーマット値の問題が修正された • ツールの実行失敗時のエラーハンドリングが改善された • 優雅なリトライメカニズムが導入され、信頼性が向上した

RooCodeInc/Roo-Code2025-12-19

apireleasetool

Agent Skills

Anthropic have turned their skills mechanism into an "open standard", which I guess means it lives in an independent agentskills/agentskills GitHub repository now? I wouldn't be surprised to see this …

Simon Willison's Blog2025-12-19

apitool

Power Up Diffusion LLMs: Day‑0 Support for LLaDA 2.0

<h2><a id="tldr" class="anchor" href="#tldr" aria-hidden="true"><svg aria-hidden="true" class="octicon octicon-link" height="16" version="1.1" viewbox="0 0 1...

LMSYS Blog2025-12-19

librarytool

Google Research 2025: Bolder breakthroughs, bigger impact

2025年、Google Researchは研究の加速を実現し、製品、科学、社会に影響を与える画期的な成果を上げた。AIの基盤となる技術の進展により、生成モデルはより効率的で事実に基づき、多言語かつ多文化に対応するようになった。新しいアーキテクチャやアルゴリズムの研究が進み、科学的発見を加速するAIツールやエージェントモデルが開発された。量子コンピューティングの実用化に向けた量子のブレークスルーや、地球科学の研究が進展し、気候変動、健康、教育といった社会的優先事項にも取り組んだ。特に、Gemini 3は事実性の面で最高の性能を誇り、ユーザーはGoogleの製品が世界の知識に基づいた出力を提供することを信頼できる。 • Google Researchは2025年に研究の加速を実現し、製品や社会に影響を与える成果を上げた。 • AIの基盤技術の進展により、生成モデルが効率的で事実に基づくものになった。 • 新しいアーキテクチャやアルゴリズムの研究が進み、科学的発見を加速するAIツールが開発された。 • 量子コンピューティングの実用化に向けたブレークスルーが達成された。 • Gemini 3は事実性の面で最高の性能を誇り、ユーザーは信頼できる出力を得られる。

Google Research2025-12-18

platform

langgraph-sdk==0.3.1

この記事は、langgraph-sdkのバージョン0.3.1のリリースに関する情報を提供しています。このリリースは2022年12月18日に行われ、主な変更点として、sdkのバージョンを0.3.1に引き上げたことが挙げられます。また、モデルタイプ特有のカスタムJSON暗号化注釈を削除し、キー保持の制限についての文書化が行われました。これにより、ユーザーはよりシンプルな実装が可能となり、暗号化に関する理解が深まることが期待されます。 • langgraph-sdkのバージョン0.3.1がリリースされた • 主な変更点はモデルタイプ特有のカスタムJSON暗号化注釈の削除 • キー保持の制限についての文書化が行われた • ユーザーはよりシンプルな実装が可能になる • 暗号化に関する理解が深まることが期待される

langchain-ai/langgraph2025-12-18

releasetool

Release v3.36.13

RooCodeIncのRoo-Codeリポジトリのリリースv3.36.13では、いくつかの重要な変更が行われた。デフォルトのツールプロトコルがXMLからネイティブに変更され、信頼性とパフォーマンスが向上した。また、VS Codeの言語モデルAPIプロバイダーに対するネイティブツールサポートが追加された。タスクツールプロトコルがロックされ、タスクの再開時に同じプロトコルが使用されることが保証されるようになった。さらに、diff編集機能を改善するために、edit_fileツールのエイリアスが実際のedit_fileツールに置き換えられた。LiteLLMルーターモデルの修正も行われ、ネイティブツール呼び出しサポートのためにデフォルトモデル情報が統合された。最後に、連続的なエラーを追跡するためのPostHog例外追跡が追加され、エラーモニタリングが改善された。 • デフォルトのツールプロトコルがXMLからネイティブに変更され、信頼性とパフォーマンスが向上した。 • VS Codeの言語モデルAPIプロバイダーに対するネイティブツールサポートが追加された。 • タスクツールプロトコルがロックされ、タスクの再開時に同じプロトコルが使用されることが保証された。 • diff編集機能を改善するために、edit_fileツールのエイリアスが実際のedit_fileツールに置き換えられた。 • LiteLLMルーターモデルの修正が行われ、ネイティブツール呼び出しサポートのためにデフォルトモデル情報が統合された。 • 連続的なエラーを追跡するためのPostHog例外追跡が追加され、エラーモニタリングが改善された。

RooCodeInc/Roo-Code2025-12-18

releasetool

What is sycophancy in AI models?

Learn what AI researchers mean when they talk about sycophancy, when it's more likely to show up in conversations, and tactics you can use to steer AI towards truth.

YouTube Anthropic2025-12-18

langchain-core==1.2.3

この記事は、Langchainのコアライブラリのバージョン1.2.3のリリースに関する情報を提供しています。このリリースでは、主に2つの変更が行われました。1つ目は、convert_to_openai_messages関数において未知のブロックを許可する修正が加えられたことです。2つ目は、CI（継続的インテグレーション）チェックが追加され、ロックファイルの更新が必要かどうかを確認する機能が実装されたことです。これにより、開発者は依存関係の管理が容易になり、よりスムーズな開発プロセスが期待されます。 • バージョン1.2.3のリリースに関する情報 • convert_to_openai_messages関数の修正により未知のブロックを許可 • CIチェックの追加によりロックファイルの更新確認が可能に • 依存関係の管理が容易になる • 開発プロセスのスムーズさが向上する

langchain-ai/langchain2025-12-18

libraryreleasetool

Let Claude handle work in your browser

See Claude for Chrome handle three complete workflows in your browser. Pull data from dashboards into one analysis doc Address slide comments automatically Build with Claude Code, test in Chrome Claude for Chrome is a browser extension that lets Claude see, click, type, and navigate web pages. Try it: claude.com/chrome

YouTube Anthropic2025-12-18

Deploying Smarter: Hardware-Software Co-design in PyTorch

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に簡単に統合できるよう設計されており、ユーザーは特別な設定を行うことなくすぐに利用を開始できます。さらに、AIによるコード生成は、開発の効率を大幅に向上させることが期待されています。 • AI技術を活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への簡単な統合 • 開発効率の向上が期待される • 自然言語処理を用いた強化された機能

PyTorch Blog2025-12-18

tool

langchain-openai==1.1.6

この記事は、GitHub上でのlangchain-openaiライブラリのバージョン1.1.6のリリースに関する情報を提供しています。このリリースでは、gpt-5シリーズの最大入力トークン数が更新されました。具体的には、バージョン1.1.5からの変更点として、gpt-5シリーズに関連するトークン数の制限が見直され、より多くのトークンを処理できるようになっています。これにより、ユーザーはより長い入力を使用してモデルを活用できるようになります。 • gpt-5シリーズの最大入力トークン数が更新された • バージョン1.1.5からの変更点がある • ユーザーはより長い入力を使用できるようになる • リリースはGitHubで行われた • リリース日は2023年12月18日である

langchain-ai/langchain2025-12-18

apilibraryrelease

Build and deploy scalable AI agents with NVIDIA NeMo, Amazon Bedrock AgentCore, and Strands Agents

This post demonstrates how to use the powerful combination of Strands Agents, Amazon Bedrock AgentCore, and NVIDIA NeMo Agent Toolkit to build, evaluate, optimize, and deploy AI agents on Amazon Web Services (AWS) from initial development through production deployment.

AWS Machine Learning Blog2025-12-18

apiframeworktool

Building Apps for ChatGPT with Apollo MCP Server and Apollo Client

This post will introduce a tutorial on how to build an app for ChatGPT using Apollo Client and Apollo MCP Server. You’ll have what you need to get started with our opinionated stack for building these apps.

apollo-blog2025-12-18

apiframeworktool

Bi-directional streaming for real-time agent interactions now available in Amazon Bedrock AgentCore Runtime

In this post, you will learn about bi-directional streaming on AgentCore Runtime and the prerequisites to create a WebSocket implementation. You will also learn how to use Strands Agents to implement a bi-directional streaming solution for voice agents.

AWS Machine Learning Blog2025-12-18

apitool

AI transformation in financial services: 5 predictors for success in 2026

Financial services businesses are busily adopting agentic AI, with Frontier Firms leading transformation. Learn more.

Microsoft AI Blog2025-12-18

platformtool

You can now verify Google AI-generated videos in the Gemini app.

We’re expanding our content transparency tools to help you more easily identify AI-generated content. You can now check if a video was edited or created with Google AI d…

Google AI Blog2025-12-18

apitool

Inside Kaggle's AI Agents intensive course with Google

Kaggle’s AI Agents Intensive with Google brought learners together in a no-cost course to build and deploy the next frontier of AI.

Google AI Blog2025-12-18

platformtool

I tested 5 AI CLI tools: Here’s how they stack up

A hands-on comparison of five AI coding CLIs, tested by building the same React Todo app.

logrocket-dev2025-12-18

apitool

We gave AI control of a real business

For a large part of 2025, we ran Project Vend: an experiment where we let Claude manage a small business in the Anthropic office. We learned a lot from how close it was to success—and the curious ways that it failed—about the plausible, strange, not-too-distant future in which AI models might autonomously run things in the real economy. The shopkeeper (who we named Claudius) had to source products, set prices, manage inventory, and deal with customers. Things got really, really weird. Read more about the experiment: https://www.anthropic.com/research/project-vend-2 0:00 Background on Project Vend 0:35 How a transaction works 1:27 Claudius's naïveté 2:29 An identity crisis 3:57 The CEO agent 5:04 Conclusion

YouTube Anthropic2025-12-18

Evaluating chain-of-thought monitorability

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に統合できるよう設計されており、導入が容易です。これにより、開発の効率が向上し、エラーの削減が期待されます。さらに、ユーザーからのフィードバックを基に継続的に改善される点も強調されています。 • AIを活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への統合が容易 • 開発効率の向上とエラー削減が期待される • ユーザーからのフィードバックを基にした継続的な改善

OpenAI Blog2025-12-18

tool

Deepening our collaboration with the U.S. Department of Energy

OpenAI Blog2025-12-18

tool

AI literacy resources for teens and parents

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特に自然言語処理を用いた機能が強化されています。具体的には、開発者が自然言語で指示を出すと、AIがそれに基づいてコードを生成することが可能です。また、ツールは既存の開発環境に統合できるよう設計されており、使いやすさが考慮されています。さらに、AIによるコード生成は、開発の効率を大幅に向上させることが期待されています。 • AI技術を活用した新しい開発ツールの紹介 • 自然言語での指示に基づいてコードを生成する機能 • 既存の開発環境への統合が可能 • 開発効率の向上が期待される • 使いやすさが考慮された設計

OpenAI Blog2025-12-18

tool

Updating our Model Spec with teen protections

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。特に、AIを用いたコード補完機能や自動生成機能が強調されており、開発者の生産性を向上させることが期待されています。また、これらのツールは、特定のプログラミング言語やフレームワークに対応しており、ユーザーが簡単に導入できるように設計されています。さらに、AI技術の進化により、従来の開発プロセスが大きく変わる可能性があることも指摘されています。 • AIを活用したコード補完機能がある • 自動生成機能により開発者の生産性が向上する • 特定のプログラミング言語やフレームワークに対応 • ユーザーが簡単に導入できる設計 • AI技術の進化が開発プロセスを変える可能性がある

OpenAI Blog2025-12-18

librarytool

Release v3.36.12

この記事は、RooCodeIncのGitHubリポジトリにおけるリリースv3.36.12に関する情報を提供しています。このリリースでは、BedrockエンベッダーにuserAgentAppIdを追加し、コードインデックスの改善が行われました。また、OpenAIとGeminiのツール設定が更新され、モデルの動作が向上しました。さらに、PostHogエラーグルーピングのためにJSONペイロードからエラーメッセージを抽出する機能が追加されました。これらの変更は、開発者がより効率的にエラーを管理し、ツールのパフォーマンスを向上させることを目的としています。 • BedrockエンベッダーにuserAgentAppIdを追加し、コードインデックスを改善 • OpenAIとGeminiのツール設定を更新し、モデルの動作を向上 • PostHogエラーグルーピングのためにJSONペイロードからエラーメッセージを抽出する機能を追加 • これらの変更により、開発者はエラー管理が効率的になる • リリースは2025年12月18日に行われた

RooCodeInc/Roo-Code2025-12-18

releasetool

Inside PostHog: How SSRF, a ClickHouse SQL Escaping 0day, and Default PostgreSQL Credentials Formed an RCE Chain

Mehmet Ince describes a very elegant chain of attacks against the PostHog analytics platform, combining several different vulnerabilities (now all reported and fixed) to achieve RCE - Remote Code Execution …

Simon Willison's Blog2025-12-18

apidatabasesecurity

Counsel Health Is Multiplying the World's Clinical Capacity with Mastra

How Counsel Health is using Mastra to multiply the world's clinical capacity.

Mastra Blog2025-12-18

aiapicloud

Watch a podcast discussion about Gemini 3 and the future of Search.

Learn how Gemini 3 powers Google Search with Generative UI, Nano Banana, and interactive graphics.

Google AI Blog2025-12-18

platform

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Hugging Face Blog2025-12-18

librarytool

Introducing GPT-5.2-Codex

OpenAI Blog2025-12-18

tool

Addendum to GPT-5.2 System Card: GPT-5.2-Codex

この記事では、最新のAI技術を活用した新しい開発ツールについて説明しています。このツールは、開発者がコードを書く際にAIの支援を受けることができるもので、特にJavaScriptやTypeScriptのプロジェクトにおいて効果を発揮します。具体的には、コードの自動生成やエラーチェック、最適化提案などの機能があり、開発の効率を大幅に向上させることが期待されています。また、ツールの導入方法や設定手順についても詳しく解説されており、初心者でも簡単に利用できるよう配慮されています。最後に、AI技術の進化に伴い、今後の開発環境がどのように変化していくかについての展望も述べられています。 • AI技術を活用した新しい開発ツールの紹介 • JavaScriptやTypeScriptプロジェクトでの効果 • コードの自動生成、エラーチェック、最適化提案機能 • 初心者でも簡単に利用できる導入方法と設定手順 • 今後の開発環境の変化に関する展望

OpenAI Blog2025-12-18

librarytool

Introducing GPT-5.2-Codex

OpenAI Blog2025-12-18

tool

Building a Code Review system that uses prod data to predict bugs

See how Sentry’s AI Code Review works—using real Sentry data to predict bugs in your PRs, cut noise, and suggest fixes before you ship.

sentry-blog2025-12-18

apitool

AoAH Day 15: Porting a complete HTML5 parser and browser test suite

Anil Madhavapeddy is running an Advent of Agentic Humps this year, building a new useful OCaml library every day for most of December. Inspired by Emil Stenström's JustHTML and my …

Simon Willison's Blog2025-12-17

apilibrarytool

Gemini 3 Flash

It continues to be a busy December, if not quite as busy as last year. Today’s big news is Gemini 3 Flash, the latest in Google’s “Flash” line of faster …

Simon Willison's Blog2025-12-17

tool

Release v3.36.11

RooCodeIncのRoo-Codeのリリースv3.36.11では、Claude Code Providerのネイティブツール呼び出しのサポートが追加され、ツール実行のパフォーマンスと信頼性が向上しました。また、Z.aiモデルに対してネイティブツール呼び出しがデフォルトで有効化され、モデルの互換性が改善されました。OpenAI互換プロバイダーに対してもネイティブツールがデフォルトで有効化され、ツール呼び出しのサポートが強化されました。さらに、BedrockとOpenAIの厳密モードにおけるMCPツールスキーマの正規化や、Bedrock互換のためのツール名からのドットとコロンの削除、ネイティブツールが無効な場合のtool_resultのXMLテキストへの変換などの修正が行われました。AWS GovCloudおよび中国地域のARNsのサポートも追加され、地域的なサポートが拡大しました。 • Claude Code Providerのネイティブツール呼び出しのサポート追加 • Z.aiモデルに対するネイティブツール呼び出しのデフォルト有効化 • OpenAI互換プロバイダーに対するネイティブツールのデフォルト有効化 • MCPツールスキーマの正規化による互換性向上 • Bedrock互換のためのツール名からのドットとコロンの削除 • ネイティブツール無効時のtool_resultのXMLテキスト変換 • AWS GovCloudおよび中国地域のARNsのサポート追加

RooCodeInc/Roo-Code2025-12-17

apireleasetool

langchain-openai==1.1.5

この記事は、GitHub上でのlangchain-openaiライブラリのバージョン1.1.5のリリースに関する情報を提供しています。このリリースは2023年12月17日に行われ、主な変更点として、chunk_positionの設定に関してlangchain-coreに依存する修正が含まれています。これにより、ライブラリの機能が向上し、より安定した動作が期待されます。リリースノートには、前のバージョン1.1.4からの変更点が明記されており、開発者が新しい機能や修正を把握しやすくなっています。 • langchain-openaiライブラリのバージョン1.1.5がリリースされた • リリース日は2023年12月17日 • chunk_positionの設定に関してlangchain-coreに依存する修正が行われた • この修正によりライブラリの機能が向上した • リリースノートには前のバージョンからの変更点が記載されている

langchain-ai/langchain2025-12-17

libraryrelease

Microsoft named a Leader in Gartner® Magic Quadrant™ for AI Application Development Platforms

Learn how Microsoft Foundry—our unified, interoperable AI platform—is enabling developers to build faster, smarter, and safer AI apps.

Microsoft AI Blog2025-12-17

apicloudtool

Binti helps social workers license foster families faster with Claude

Binti is transforming child welfare by helping social workers license foster and adoptive families faster. With 400,000 children in U.S. foster care, Binti integrated Claude to reduce paperwork from weeks to hours—shrinking approval timelines by 18%.

YouTube Anthropic2025-12-17

The AI software engineer in 2026

AI tools are everywhere, but trust is falling. Learn how engineers become orchestrators in 2026, choosing which agents to scaffold, ship, and maintain.

Builder.io Blog2025-12-17

apiframeworktool

Tracking and managing assets used in AI development with Amazon SageMaker AI

In this post, we'll explore the new capabilities and core concepts that help organizations track and manage models development and deployment lifecycles. We will show you how the features are configured to train models with automatic end-to-end lineage, from dataset upload and versioning to model fine-tuning, evaluation, and seamless endpoint deployment.

AWS Machine Learning Blog2025-12-17

apitool

Track machine learning experiments with MLflow on Amazon SageMaker using Snowflake integration

In this post, we demonstrate how to integrate Amazon SageMaker managed MLflow as a central repository to log these experiments and provide a unified system for monitoring their progress.

AWS Machine Learning Blog2025-12-17

apitool

Period

Categories

Sources (44)

Release v3.39.2

Crossmodal search with Amazon Nova Multimodal Embeddings

stagehand/server v3.3.0

langgraph-sdk==0.3.2

Supercharging LLMs: Scalable RL with torchforge and Weaver

Accelerating LLM inference with post-training weight and activation using AWQ and GPTQ on Amazon SageMaker AI

langchain-core==1.2.7

How Beekeeper optimized user personalization with Amazon Bedrock

Sentiment Analysis with Text and Audio Using AWS Generative AI Services: Approaches, Challenges, and Solutions

Architecting TrueLook’s AI-powered construction safety system on Amazon SageMaker AI

This Year with ChatGPT

OpenAI and SoftBank Group partner with SB Energy

Warp Specialization in Triton: Design and Roadmap

Datadog uses Codex for system-level code review

Release v3.39.1

Insecure Agents Podcast: Certified Patches, Supply Chain Security, and AI Agents

langchain==1.2.3

PyTorch 2.9: FlexAttention Optimization Practice on Intel GPUs

LLM predictions for 2026, shared with Oxide and Friends

Scaling medical content review at Flo Health using Amazon Bedrock (Part 1)

Preparing for Appointments | with ChatGPT

Detect and redact personally identifiable information using Amazon Bedrock Data Automation and Guardrails

Speed meets scale: Load testing SageMakerAI endpoints with Observe.AI’s testing tool

AI's limited self-knowledge

How Google Got Its Groove Back and Edged Ahead of OpenAI

Release v3.39.0

Netomi’s lessons for scaling agentic systems into the enterprise

15 best n8n practices for deploying AI agents in production

OpenAI for Healthcare

v0.18.4 Patch Release

langchain==1.2.2

Infosys partners with Cognition to expand engineering capacity and help scale its enterprise business

Securely connecting your information and apps with ChatGPT Health

Personalized nutrition tips with ChatGPT

Helping you choose the right insurance plan for you with ChatGPT

Preparing for a doctor’s appointment with ChatGPT

Quoting Adam Wathan

Understanding your Scan Results | with ChatGPT

Best AI Coding Tools for Developers in 2026

How we made v0 an effective coding agent

How Tolan builds voice-first AI with GPT-5.1

3行で始める文章検索 ― txtai入門

Quoting Robin Sloan

Understanding Inflammation | with ChatGPT

langchain==1.2.1

Introducing ChatGPT Health

A field guide to sandboxes for AI

Small Yet Mighty: Improve Accuracy In Multimodal Search and Visual Document Retrieval with Llama Nemotron RAG Models

Balancing Motherhood | with ChatGPT

Don’t ship another chat UI. Build real AI with AG-UI

Commonwealth Bank of Australia builds AI fluency at scale

BNY People uses OpenAI

Microsoft’s strategic AI datacenter planning enables seamless, large-scale NVIDIA Rubin deployments

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Generalist Robot Policy Evaluation in Simulation with NVIDIA Isaac Lab-Arena and LeRobot

langchain-xai==1.2.1

BNY Sales uses OpenAI

BNY Legal uses OpenAI

BNY builds “AI for everyone, everywhere” with OpenAI

Navigating Health | with ChatGPT

Oxide and Friends Predictions 2026, today at 4pm PT

AI Gateway support for Claude Code

Introducing Falcon H1R 7B

Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture

NVIDIA brings agents to life with DGX Spark and Reachy Mini

The November 2025 inflection point

Helping people write code again

🥇Top AI Papers of the Week

Quoting Jaana Dogan

Release v3.38.3

🤖AI Agents Weekly: LLMs in 2025, YOLO in the Sandbox, Plan Caching for Agents, DeepTutor

Was Daft Punk Having a Laugh When They Chose the Tempo of Harder, Better, Faster, Stronger?

langchain-core==1.2.6

Quoting Will Larson

langchain-xai==1.2.0

December 2025 sponsors-only newsletter

Quoting Ben Werdmuller