Model Configuration
Learn how to configure and customize models for optimal performance in medical AI evaluation. The loader_args that are defined on the ModelMeta can be tweaked
Parameter Tuning
Section titled “Parameter Tuning”Generation Parameters
Section titled “Generation Parameters”Control model behavior with precision:
# Conservative generation for medical accuracykarma eval --model qwen --model-path "Qwen/Qwen3-0.6B" \  --datasets openlifescienceai/pubmedqa \  --model-kwargs '{    "temperature": 0.1,    "top_p": 0.9,    "top_k": 50,    "max_tokens": 512,    "enable_thinking": true,    "seed": 42  }'
# Creative generation for medical educationkarma eval --model qwen --model-path "Qwen/Qwen3-0.6B" \  --datasets medical_education_dataset \  --model-kwargs '{    "temperature": 0.7,    "top_p": 0.95,    "max_tokens": 1024,    "enable_thinking": false  }'Parameter Reference
Section titled “Parameter Reference”| Parameter | Range | Description | Medical Use Case | 
|---|---|---|---|
| temperature | 0.0-1.0 | Randomness control | 0.1-0.3 for diagnostic accuracy | 
| top_p | 0.0-1.0 | Nucleus sampling | 0.9 for balanced responses | 
| top_k | 1-100 | Top-k sampling | 50 for medical terminology | 
| max_tokens | 1-4096 | Output length | 512 for concise answers | 
| enable_thinking | boolean | Reasoning mode | true for complex cases | 
| seed | integer | Reproducibility | Set for consistent results | 
Model-Specific Configuration
Section titled “Model-Specific Configuration”Qwen Models
Section titled “Qwen Models”# Thinking mode for complex medical reasoningkarma eval --model qwen --model-path "Qwen/Qwen3-0.6B" \  --datasets openlifescienceai/pubmedqa \  --model-kwargs '{    "enable_thinking": true,    "thinking_depth": 3,    "temperature": 0.2,    "max_tokens": 512  }'
# Fast inference modekarma eval --model qwen --model-path "Qwen/Qwen3-0.6B" \  --datasets openlifescienceai/pubmedqa \  --model-kwargs '{    "enable_thinking": false,    "temperature": 0.1,    "max_tokens": 256,    "use_cache": true  }'MedGemma Models
Section titled “MedGemma Models”# Medical accuracy optimizationkarma eval --model medgemma --model-path "google/medgemma-4b-it" \  --datasets openlifescienceai/medmcqa \  --model-kwargs '{    "temperature": 0.05,    "top_p": 0.8,    "repetition_penalty": 1.1,    "max_tokens": 400,    "medical_mode": true  }'Audio Models
Section titled “Audio Models”# IndicConformer language-specific configurationkarma eval --model "ai4bharat/indic-conformer-600m-multilingual" \  --model-path "ai4bharat/indic-conformer-600m-multilingual" \  --datasets "ai4bharat/indicvoices_r" \  --model-kwargs '{    "language": "Hindi",    "chunk_length": 30,    "stride": 5,    "batch_size": 1,    "use_lm": true  }'
# Whisper optimizationkarma eval --model openai-whisper \  --datasets medical_audio_dataset \  --model-kwargs '{    "model": "whisper-1",    "language": "en",    "temperature": 0.0,    "condition_on_previous_text": true,    "compression_ratio_threshold": 2.4  }'