ランタイム設定 (Runtime Configuration)¶

ADKでサポートPython v0.1.0TypeScript v0.2.0Go v0.1.0Java v0.1.0Kotlin v0.1.0

RunConfig は、streaming mode、speech setting、LLM call limit、live agent option など、エージェントのランタイム動作を制御します。デフォルト動作を上書きするには、 RunConfig を runner.run_async() または runner.run_live() に渡します。

PythonTypeScriptGoJavaKotlin

from google.adk.agents.run_config import RunConfig, StreamingMode

config = RunConfig(
    streaming_mode=StreamingMode.SSE,
    max_llm_calls=200,
)

async for event in runner.run_async(
    ...,
    run_config=config,
):
    ...

import { RunConfig, StreamingMode } from '@google/adk';

const config: RunConfig = {
  streamingMode: StreamingMode.SSE,
  maxLlmCalls: 200,
};

import "google.golang.org/adk/agent"

config := agent.RunConfig{
    StreamingMode: agent.StreamingModeSSE,
}

import com.google.adk.agents.RunConfig;
import com.google.adk.agents.RunConfig.StreamingMode;

RunConfig config = RunConfig.builder()
    .streamingMode(StreamingMode.SSE)
    .maxLlmCalls(200)
    .build();

val config =
    RunConfig(
        streamingMode = StreamingMode.SSE,
    )

// Pass it to runner.runAsync
// runner.runAsync(..., runConfig = config)

セッションとコンテキストの管理¶

ADKでサポートPython

Long-running session では、どれだけの history を load するか、context window を圧縮するかを制御できます。

get_session_config: session を load するときに取得する event を制限します。 invocation ごとに event history 全体を load しないよう、num_recent_events または after_timestamp を使います。
context_window_compression: LLM input の context window compression を有効にします。 session が model context limit に近づく場合に便利です。

Python

from google.adk.agents.run_config import RunConfig
from google.adk.sessions.base_session_service import GetSessionConfig

config = RunConfig(
    get_session_config=GetSessionConfig(num_recent_events=50),
)

ストリーミングの有効化¶

エージェントがレスポンスを配信する方法を制御するには、streaming_mode パラメータを設定します。

StreamingMode.NONE (デフォルト): runner は turn ごとに 1 つの完全なレスポンスを返します。CLI tool、batch processing、synchronous workflow に適しています。
StreamingMode.SSE: Server-Sent Events streaming です。LLM が生成している間、 runner が partial event を yield し、typewriter-style UI や real-time chat display を実現できます。
StreamingMode.BIDI: bidirectional streaming 用に予約されていますが、標準の run_async() path では 使用されません。Bidirectional streaming には runner.run_live() を使ってください。

StreamingMode.SSE とともに support_cfc=True を設定すると、Compositional Function Calling(CFC) を有効にできます。CFC により、model は function call を動的に構成して実行でき、内部では Live API を使用します。

Experimental

CFC サポートは実験的であり、将来のリリースで API や動作が変更される可能性があります。

PythonTypeScriptGoJavaKotlin

from google.adk.agents.run_config import RunConfig, StreamingMode

config = RunConfig(
    streaming_mode=StreamingMode.SSE,
    support_cfc=True,
    max_llm_calls=150,
)

import { RunConfig, StreamingMode } from '@google/adk';

const config: RunConfig = {
    streamingMode: StreamingMode.SSE,
    supportCfc: true,
    maxLlmCalls: 150,
};

import "google.golang.org/adk/agent"

config := agent.RunConfig{
    StreamingMode: agent.StreamingModeSSE,
}

import com.google.adk.agents.RunConfig;
import com.google.adk.agents.RunConfig.StreamingMode;

RunConfig config = RunConfig.builder()
    .streamingMode(StreamingMode.SSE)
    .maxLlmCalls(150)
    .build();

val streamingConfig =
    RunConfig(
        streamingMode = StreamingMode.SSE,
    )

音声と発話の設定¶

ADKでサポートPythonTypeScriptJava

Voice-enabled agent では、speech synthesis、audio transcription、response modality を設定します。

speech_config: speech output の voice と language を設定します。たとえば en-US と "Kore" voice を使用できます。
response_modalities: output format を制御します。発話と text 返却の両方を行うエージェントでは ["AUDIO", "TEXT"] に設定します。
output_audio_transcription / input_audio_transcription: model の audio output とユーザーの audio input の transcription を有効にします。Python ではどちらもデフォルトが AudioTranscriptionConfig() です。

PythonTypeScriptJava

from google.adk.agents.run_config import RunConfig, StreamingMode
from google.genai import types

config = RunConfig(
    speech_config=types.SpeechConfig(
        language_code="en-US",
        voice_config=types.VoiceConfig(
            prebuilt_voice_config=types.PrebuiltVoiceConfig(
                voice_name="Kore"
            )
        ),
    ),
    response_modalities=["AUDIO", "TEXT"],
    streaming_mode=StreamingMode.SSE,
    max_llm_calls=1000,
)

import { RunConfig, StreamingMode } from '@google/adk';
import { Modality } from '@google/genai';

const config: RunConfig = {
    speechConfig: {
        languageCode: "en-US",
        voiceConfig: {
            prebuiltVoiceConfig: {
                voiceName: "Kore"
            }
        },
    },
    responseModalities: [Modality.AUDIO, Modality.TEXT],
    streamingMode: StreamingMode.SSE,
    maxLlmCalls: 1000,
};

import com.google.adk.agents.RunConfig;
import com.google.adk.agents.RunConfig.StreamingMode;
import com.google.common.collect.ImmutableList;
import com.google.genai.types.Modality;
import com.google.genai.types.PrebuiltVoiceConfig;
import com.google.genai.types.SpeechConfig;
import com.google.genai.types.VoiceConfig;

RunConfig runConfig =
    RunConfig.builder()
        .streamingMode(StreamingMode.SSE)
        .maxLlmCalls(1000)
        .responseModalities(ImmutableList.of(new Modality(Modality.Known.AUDIO), new Modality(Modality.Known.TEXT)))
        .speechConfig(
            SpeechConfig.builder()
                .voiceConfig(
                    VoiceConfig.builder()
                        .prebuiltVoiceConfig(
                            PrebuiltVoiceConfig.builder().voiceName("Kore").build())
                        .build())
                .languageCode("en-US")
                .build())
        .build();

Live agent の設定¶

ADKでサポートPythonTypeScript

runner.run_live() を使用するときは、次の追加パラメータで real-time behavior を設定します。

realtime_input_config: ユーザーから audio input を受け取る方法を設定します。
proactivity: model が proactive に応答し、関係のない input を無視できるようにします。
enable_affective_dialog: True の場合、model がユーザーの感情を検出し、それに応じて tone を調整します。
avatar_config: live agent 用の avatar を設定します。
session_resumption: disconnect をまたいだ transparent session resumption を有効にします。
save_live_blob: True の場合、live audio と video data を session と artifact service に保存します。
tool_thread_pool_config: event loop がユーザーの interruption に responsive であり続けるよう、 tool execution を background thread pool で実行します。

すべてのパラメータがすべての言語で利用できるわけではありません。言語別の詳細は API reference を参照してください。

PythonTypeScript

from google.adk.agents.run_config import RunConfig, ToolThreadPoolConfig

config = RunConfig(
    save_live_blob=True,
    tool_thread_pool_config=ToolThreadPoolConfig(max_workers=8),
)

Thread pool and the GIL

Thread pool は blocking I/O と、GIL を release する C extension(例: time.sleep()、network call、numpy)に役立ちます。純粋な Python CPU-bound code には役立ちません。GIL が Python bytecode の真の並列実行を妨げるためです。

import { RunConfig } from '@google/adk';

const config: RunConfig = {
    enableAffectiveDialog: true,
    proactivity: {
        proactiveAudio: true,
    },
};

ランタイム制限とデバッグの設定¶

次のパラメータで runtime guardrail と debugging を制御します。

max_llm_calls: run ごとの LLM call 合計数を制限します(デフォルト: 500)。0 または負の値は無制限の呼び出しを意味しますが、本番環境では推奨されません。sys.maxsize 以上の値はエラーになります。
save_input_blobs_as_artifacts: True の場合、input blob(例: uploaded file)を debugging と auditing 用の run artifact として保存します。
custom_metadata: invocation に添付される任意 metadata の dict[str, Any] です。tracing や logging に便利です。

API reference¶

field、type、default の完全な一覧は、各言語の API reference を参照してください。