Time Series Optimization & Optimal Control

Q: What kind of problems can this solve?

Anything where you have a simulation, controls at each time step, constraints, uncertainty, and an objective. The simulation needs to be differentiable or smoothly approximable. The domain doesn't matter much: gas storage, insulin dosing, hydropower, and climate risk analytics (NGFS stress testing, carbon sensitivity) all use the same machinery.

Q: How is this different from reinforcement learning?

RL estimates gradients from noisy reward signals over millions of episodes. We compute exact gradients in a single backward pass through the simulation. In practice that means 100-400x faster training, deterministic policies, and all sensitivities from the same computation.

Q: How is this different from dynamic programming?

DP discretizes the state space into a grid and works backward through time. That scales exponentially with the number of state variables. We parameterize the policy with a neural network, so adding state dimensions adds inputs to the network but doesn't explode the computation. 10 coupled reservoirs train in 80 seconds.

Q: What do you mean by 'sensitivities for free'?

The adjoint pass computes the gradient of the objective with respect to every input in one backward pass. 5 risk factors or 365 daily price deltas, same cost. Roughly equal to one forward evaluation.

Q: What accuracy guarantees do you provide?

We validate every model with a cohort mirror test: the same computation in standard floating-point and in AADC must agree to machine precision (< 1e-12 relative error). Adjoint gradients are checked against finite differences on independent code paths.

Q: What hardware do I need?

Any modern x86 CPU with AVX2. No GPU. All benchmarks on this page ran on a single CPU. Multi-threading across 8+ cores is supported.

Q: How does this compare to PyTorch autograd?

We implemented the same gas storage algorithm in PyTorch. It was 16x slower for training and 2,500x slower for evaluation. The difference is AADC's JIT compilation to AVX machine code vs Python interpreter overhead.

Q: Can I use my own simulation code?

Yes. Write your simulation as a templated C++ function or use AADC's idouble type in Python. We handle the recording, optimization, and sensitivity computation.

Q: Do I need machine learning expertise?

No. The neural policy, optimizer, and training loop are pre-built. You write the simulation and define the constraints.

Q: Can RL eventually match this with enough training?

PPO captured 65% of the gas storage optimum after 3.7 hours. SAC failed on hydropower (-$5.8M loss). RL estimates gradients from noisy rewards; we compute them exactly. More training time doesn't close that gap.

Applications

Gas Storage Optimization

Energy & Commodities

Daily injection/withdrawal decisions for natural gas storage. Maximize value under stochastic prices and contractual demand obligations.

The problem with current methods: Works for a single facility with one price factor. Add a second facility or demand obligations and the state space explodes.

Metric	Our approach	Best alternative	Improvement
Training time	40 seconds	RL (SAC): 14,163 sec	363x faster
Value (with demand obligations)	$3.95M	DP: $2.36M	+67%
365 forward curve deltas	0 ms (adjoint, free)	DP bump-and-revalue: 5,110 ms	Infinite
3-facility portfolio	145 sec (linear)	DP: 150^3 = intractable	Tractable
Eval vs PyTorch	0.1 ms / 1,024 paths	245 ms	2,500x

Key insight: Demand obligations add state dimensions. The neural policy sees demand state as another input, so there is no grid explosion.

Architecture: 6 inputs, 5+5 hidden, 2 outputs = 77 neural weights + 730 daily biases = 807 total parameters.

Hydropower Reservoir Optimization

Energy & Commodities

Coordinate water release across cascaded reservoirs to maximize electricity revenue while meeting environmental flow constraints.

The problem with current methods: Decomposed optimization of coupled reservoirs misses coordination value. Joint optimization is exponential in the number of reservoirs.

Metric	Our approach	Best alternative	Improvement
Annual value (Hungry Horse Dam, 428 MW)	$80.24M	SDP: $79.98M	+$252K/year
Training time (2,000 iterations)	29.2 seconds	FD gradient: ~2.4 hours	298x faster
10-reservoir cascade	$12.89M (80.5 sec)	Decomposed: $7.16M	+80% value
Cascade scaling	Linear (30 reservoirs in 244 sec)	SDP: exponential	Tractable
All Greeks (5 risk factors)	12 ms (1 adjoint pass)	FD: 86 ms (10 bumps)	7.2x

Key insight: Optimizing 10 coupled reservoirs jointly captures 80% more value than optimizing each one separately. Differentiable control is the only approach that scales to this.

Battery Storage & Virtual Power Plants

Energy & Commodities

Hourly charge/discharge decisions for arbitrage and ancillary service revenue, subject to state-of-charge constraints. Scales from one battery to fleets of thousands.

The problem with current methods: A single battery is tractable with DP. A fleet of heterogeneous batteries with different capacities, efficiencies, and degradation profiles is not.

Metric	Our approach	Best alternative	Improvement
DP value capture (single battery)	88.9% of deterministic DP	DP: 100% (single-unit only)	Near-optimal
Training time	25-30 seconds	RL: hours	~400x faster
Eval (256 scenarios + all Greeks)	39 ms	FD for Greeks: N bumped runs	Free Greeks
10-battery fleet profit	EUR 308.87/day	Heuristic: EUR 269.61/day	+14.6%
Fleet scaling	Linear (1,000 batteries in ~4 sec eval)	Independent optimization	Captures coordination

VPP fleet scaling

Fleet	Recording	Eval	Kernel
10 batteries	0.5 sec	39 ms	2.8 MB
50 batteries	1.5 sec	~200 ms	14.6 MB
200 batteries	~6 sec	~800 ms	~60 MB
1,000 batteries	~30 sec	~4 sec	~300 MB

Key insight: The neural policy is shared across all resources. Adding batteries does not require retraining.

Architecture: V2: 13 inputs (SOC, price, hour/month cycles, net load, renewables, 4 forecaster features) with 16x16 hidden layers = 578 neural weights + 48 hourly biases. An end-to-end forecaster network (72 inputs from lagged prices/loads/renewables) learns a 4-feature representation consumed by the dispatch policy, all on the same AADC tape.

Asset-Liability Management (ALM)

Insurance & Pensions

Optimal dynamic asset allocation for pension funds and insurers under solvency constraints with nested Monte Carlo.

The problem with current methods: Solvency II requires sensitivities to 50-200 risk factors. Bump-and-revalue means 50-200 full re-simulations. Neural proxy models introduce 2-4% VaR error. Replicating portfolios: 23% VaR error.

Risk factors	Bump-and-revalue	Our approach (adjoint)	Speedup
5	1,235 ms	191 ms	6.5x
10	2,009 ms	191 ms	10.5x
50	9,661 ms	191 ms	50x
200	38,356 ms	191 ms	200x

Accuracy: 0% model error (exact model, not a proxy). Greeks error < 0.2% vs finite differences.

Real pension benchmarks:

Fund profile	Training	Objective improvement	E[Terminal Assets]
Underfunded (79% funded ratio)	93 sec	7.31 to 27.13	Significant recovery
Well-funded (111% funded ratio)	95 sec	67.29 to 85.00	146.57 (vs 111 start)

First published application of reverse-mode adjoint AD to insurance ALM. No prior work exists in the literature.

Variable Annuity Valuation

Insurance & Pensions

Price guarantees (GMDB, GMWB, GMAB, GMIB) under mortality and market risk with exact Greeks.

Metric	Value
Total speedup	19x (50x eval-only)
Accuracy (cohort mirror)	$0.00 difference (machine precision)
Greeks overhead	~25% (price + all Greeks vs price only)
Validated products	GMDB, GMWB, GMAB, GMIB, GMWB Ratchet

Climate Risk Analytics (NGFS, Transition & Physical Risk)

Climate Risk & Regulation

Climate-adjusted expected losses, capital requirements, and XVA across all six NGFS scenarios, three time horizons, and hundreds of counterparties. Exact adjoint sensitivities to carbon price and temperature in a single backward pass.

The problem with current methods: EIOPA requires climate risk analysis by January 2027. Banks must stress-test portfolios against NGFS scenarios. Each scenario requires repricing the entire portfolio under different carbon price and temperature paths. With bump-and-revalue, 20 climate factors across 6 scenarios and 3 horizons means hundreds of full re-simulations, taking days for a large portfolio.

Metric	Our approach	Best alternative	Improvement
Full NGFS capital matrix (6 scenarios x 3 horizons)	< 1 second	Bump-and-revalue: days	Orders of magnitude
Climate factor sensitivities (20 factors)	1 adjoint pass	20 re-simulations	~20x
Portfolio climate XVA (17 swaps, $2B notional)	59 ms (500 MC scenarios)	N/A	Real-time

NGFS capital stress results

Scenario	2030	2040	2050
Net Zero 2050	$42.5M	$59.5M	$82.9M
Current Policies	$39.1M	$40.0M	$40.6M
Delayed Transition	$38.8M	$72.8M	$93.0M

$54M between worst-case and base, quantified in seconds.

Climate-adjusted XVA

● Portfolio CVA: $3.5M (0.17% of notional)
● Carbon delta: $2,122 per $1/tCO2 increase. The marginal cost of carbon policy tightening on your portfolio.
● CVA sensitivity: +18% from $50 to $500/tCO2 carbon price
● Sector concentration: 87% of carbon sensitivity from 3 carbon-intensive counterparties (35% of notional)

Physical risk model

Temperature-dependent damage functions calibrated to IPCC AR6: flood damage +7% per degree C, hurricane intensity +15% per degree C (Knutson et al.), sea level rise ~0.3m per degree C triggering coastal damage above 0.5m. Six hazard types (river flood, coastal flood, heat stress, wildfire, drought, hurricane) with smooth differentiable damage curves.

Transition risk model

Carbon price feeds through to counterparty PD and LGD. Sector-specific carbon intensities (Energy: 800 tCO2/$M revenue, Finance: 5 tCO2/$M). Stranded asset fractions (Energy: 40%, Utilities: 25%) trigger LGD adjustments above $200/tCO2.

Carbon price dynamics

Schwartz-Smith 2-factor stochastic model (mean-reverting short-term deviations + policy-driven long-term drift) calibrated to EU ETS at $80/tCO2, plus all six NGFS deterministic pathways for regulatory scenario analysis.

Key insight: A bank with 200 counterparties across 6 NGFS scenarios needs 1,200+ stress calculations. Each scenario has 20+ climate factors. With finite differences, that is 24,000+ portfolio re-simulations. With one AADC adjoint pass per scenario, it is 6 backward passes, each producing sensitivities to all climate factors simultaneously.

Boiler NOx Emissions

Industrial Process Control

Daily fuel mix decisions (fuel gas vs natural gas) to maximize throughput while staying within a 12-month rolling NOx average limit.

The problem with current methods: Rules of thumb are conservative. Advanced Process Control (APC) handles real-time stability but not long-horizon optimization. RL takes hours and captures only 65% of the optimum.

Metric	Our approach	Best alternative	Improvement
Training time	40 seconds	RL (SAC): 3.7 hours (328x)	328x
Policy quality	100% of optimum	RL: 65%
Sensitivities	d(Profit)/d(NOx_limit), d(Profit)/d(fuel_price), etc. (free)	Not available from any alternative

Annual value: $200K--$1M in fines avoided and fuel savings per facility.

Pharmaceutical Manufacturing (ICH Q8)

Industrial Process Control

7-unit manufacturing chain (reactor, crystallizer, filter, dryer, blender, compactor, tablet press) optimized against regulatory quality targets.

Metric	Our approach	Best alternative	Improvement
Training (500 iterations)	7.2 seconds	~5 minutes (44x)	44x
ICH Q8 sensitivities (70 parameters)	149 ms (3 adjoint passes)	70 separate bumped runs
Yield	83.6%	Same (exact same model)

Regulatory teams get all 70 Critical Process Parameter sensitivities from 3 backward passes instead of 70 forward simulations.

Weather Forecasting

Weather & Environment

Physics backbone (Clausius-Clapeyron, Navier-Stokes with Coriolis, temperature diffusion/advection, orographic precipitation, katabatic winds) combined with a neural correction layer that learns what physics alone misses. The physics is universal; the neural layer adapts locally.

Metric	Result
MAE improvement (Setubal, Portugal)	25% (5.66C to 4.24C)
Training time (500 iterations)	1.4 seconds
Adjoint sensor ranking	Automatic. Discovers which sensors matter
Physics transfer	Same equations at sea level and 8,810m

Mount Everest (48h summit forecast)

Metric	Value
Stations	12 (5 NatGeo field sensors at 3,810-8,810m + 7 reanalysis)
Forecast window	48 hours (summit decision window)
Training speed	2.9 ms/iteration
Vertical range	5,000m (Base Camp to summit ridge)
Highest sensor	Bishop Rock, 8,810m

Standard grid models (Open-Meteo, ERA5) fail on high-altitude terrain. At the Pyramid Observatory (5,050m), the grid predicted -4.2C and 30% humidity. The real sensor read +6.1C and 90% humidity: a 10.3C error.

Scaling: local to global

Scale	Tiles	Weights	Scaling
Local (Setubal)	9	1,553	Baseline
Regional	64 (8x8)	1,553 (shared)	Linear
Continental	1,024 (32x32)	1,553 (shared)	Linear
Global (WeatherBench2)	2,048 (32x64)	1,553 (shared)	Linear

Key insight: The adjoint pass ranks sensor importance automatically (Almada 100%, Lisboa 44%, Pegoes 8%). No manual feature engineering.

Differentiable Robotics with MuJoCo

Robotics & Physical Simulation

AADC records MuJoCo's actual C physics code on tape. No reimplementation, no simplified physics. The real MuJoCo engine, differentiated exactly.

The problem with current methods: Isaac Gym uses simplified rigid-body physics on GPU. Brax and DiffMJX rewrite MuJoCo in JAX, losing fidelity. Finite differences cost O(N) per gradient. None give you exact adjoint gradients through the real physics.

Metric	Our approach	Best alternative	Improvement
Pendulum trajectory (50 steps, 200 iter)	57 ms total	FD: ~9,000 ms	158x
mj_step calls needed	50 (record once)	1,010,000	20,200x fewer
Drake acrobot (504 vars)	8.9 ms	205 ms	23x
Drake acrobot (1,004 vars)	25.2 ms	760 ms	30x
Gradient accuracy	Machine precision	FD noise ~1e-7	Exact

Unique capabilities

Branch tracking: MuJoCo's humanoid has 188 branch decisions per timestep. AADC tracks which depend on differentiable variables.
Parameter sensitivity: mark any physics parameter (mass, friction, restitution) as AADC input, get d(cost)/d(parameter) automatically. Sim-to-real via gradient descent.
SIMD randomized smoothing: evaluate 4-8 perturbed trajectories simultaneously using AVX lanes.
Record once, replay forever: unlike JAX-based systems that re-trace each time.

Key insight: Model Predictive Control at 100 Hz needs a full optimization cycle in under 10 ms. AADC completes 20 optimization steps on a recorded trajectory in ~6 ms. Finite differences need ~600 ms.

Architecture: AADC patches MuJoCo's C source (23 files for C++ compatibility, 53 for type substitution) so mjtNum = idouble. A 50-step pendulum trajectory produces ~1M forward operations and ~1.4M reverse operations, compiled to a reusable kernel.

Cardiac Diagnostics (ECG)

Medical

5-class cardiac anomaly detection from 6-lead ECG waveforms with exact adjoint sensitivities showing which signal features drive each diagnosis. The gradient map is fully transparent.

The problem with current methods: Deep learning ECG classifiers (CNNs, LSTMs) achieve good accuracy but cannot explain why they flagged a beat. Grad-CAM approximations are noisy. FDA 510(k) approval requires interpretability.

Metric	Our approach	Best alternative
Record-level AUC (5-class)	0.824 (ensemble)	PyTorch Inception1D: 0.840
Beat-level AUC	0.914	12-lead LSTM: 0.907
Sensitivity explanation	Exact adjoint (machine precision)	Grad-CAM (approximate)
Training time	320 sec (CPU, no GPU)	PyTorch: 260 sec (GPU)
High-confidence precision	94.4% (threshold 0.9)

Key insight: The adjoint produces d(diagnosis)/d(ECG[t, channel]): an exact map of which signal features, at which time, on which lead drove the classification. This is what regulators need.

Architecture: 247 features (lead statistics, wavelet coefficients, cross-lead correlations, spectral power, HRV context) reduced to 77 via adjoint-guided selection.

ICU Triage & Sepsis Management

Medical

12-state ODE model of patient physiology (hemodynamics, coagulation, inflammation, metabolism, organ function) with 7 intervention controls. The adjoint computes d(survival)/d(each intervention), telling clinicians which treatment to prioritize for this specific patient.

Intervention	Sensitivity	Clinical interpretation
IV fluid	+74 (highest)	Volume resuscitation is top priority
Vasopressor	+70	MAP support when volume alone fails
Insulin	+30	Potassium management (arrhythmia prevention)
pRBC transfusion	+22	Oxygen-carrying capacity restoration
FFP	+8	Coagulopathy correction (lower priority)
Antibiotic	0	Infection already controlled
Bicarbonate	-5	Natural pH recovery preferable

Key insight: These sensitivities align with clinical triage protocols, but they are computed from the patient's current labs, not from memorized rules.

Calibration against real patient data: 90.5% error reduction in 44 seconds. Policy training: 9 seconds (30 patients, 30 epochs). Reward improvement: +100% (54.6 to 109.6).

Insulin Dosing (Glucose Control)

Medical

A 13-state ODE model of glucose-insulin dynamics with a neural controller optimized via adjoint gradients.

Metric	Result
Time-in-range	99.1%
Training time	26 seconds
Channel/parameter sensitivities	Free (adjoint)

Seizure Detection (EEG)

Medical

Per-patient EEG channel selection for epilepsy wearables. Hospital EEG uses 23 electrodes; a wearable headband supports 4. The adjoint identifies which 4 matter most for each patient in 1.3 seconds.

Configuration	AUC
Full EEG (23 channels)	0.995
Reduced (4 channels, adjoint-selected)	0.847 (85% of full)
Channel selection time	1.3 seconds

Application	Our approach	RL (PPO/SAC)	Speedup	DP/SDP	Speedup
Gas storage	40 sec	14,163 sec	363x	N/A (no policy)	--
Hydropower	29 sec	Failed (-$5.8M)	--	FD gradient: 2.4 hr	298x
Boiler NOx	40 sec	3.7 hours	328x	N/A	--
Pharma (7-unit)	7.2 sec	N/A	--	FD: 5 min	44x
Weather	1.4 sec	N/A	--	N/A	--
Insurance ALM	93 sec	N/A	--	N/A	--
Climate NGFS matrix (18 cells)	< 1 sec	N/A		Bump-and-revalue: days	Orders of magnitude
Battery (single)	25 sec	Hours (est.)	~400x	DP: feasible but no Greeks
Robot trajectory (200 iter)	57 ms	N/A		FD: 9,000 ms	158x
Sepsis policy	9 sec	N/A		N/A

Application	Parameters	Adjoint (1 pass)	Finite differences	Speedup
Gas storage	365 daily deltas	0 ms (free)	5,110 ms	Free
Hydropower	5 Greeks	12 ms	86 ms	7.2x
Insurance ALM	200 risk factors	191 ms	38,356 ms	200x
Pharma	70 CPPs	149 ms	70 bumped runs	~70x
Climate risk (20 factors)	20 climate factors	1 adjoint pass	20 re-simulations	~20x
Robot (Drake, 1,004 vars)	1,004 controls	25.2 ms	760 ms	30x
Sepsis triage	7 interventions	Included in training	7 bumped simulations	Free

Dimension	DP complexity	Our approach	Result
1 facility	150 states	807 params, 40 sec	DP feasible, we're faster
3 facilities	150^3 = 3.4M states	2,456 params, 145 sec	DP intractable
10 reservoirs (coupled)	Exponential	797 params, 80.5 sec	Linear scaling
30 reservoirs	Impossible	2,237 params, 244 sec	Still linear
10 batteries (VPP)	Exponential	202 params (shared), 25 sec	Linear scaling
1,000 batteries (VPP)	Impossible	202 params (shared), ~30 sec train	Still linear

Frequently Asked Questions

What kind of problems can this solve?

Anything where you have a simulation, controls at each time step, constraints, uncertainty, and an objective. The simulation needs to be differentiable or smoothly approximable. The domain doesn't matter much: gas storage, insulin dosing, hydropower, and climate risk analytics (NGFS stress testing, carbon sensitivity) all use the same machinery.

How is this different from reinforcement learning?

RL estimates gradients from noisy reward signals over millions of episodes. We compute exact gradients in a single backward pass through the simulation. In practice that means 100-400x faster training, deterministic policies, and all sensitivities from the same computation.

How is this different from dynamic programming?

DP discretizes the state space into a grid and works backward through time. That scales exponentially with the number of state variables. We parameterize the policy with a neural network, so adding state dimensions adds inputs to the network but doesn't explode the computation. 10 coupled reservoirs train in 80 seconds.

What do you mean by 'sensitivities for free'?

The adjoint pass computes the gradient of the objective with respect to every input in one backward pass. 5 risk factors or 365 daily price deltas, same cost. Roughly equal to one forward evaluation.

What accuracy guarantees do you provide?

We validate every model with a cohort mirror test: the same computation in standard floating-point and in AADC must agree to machine precision (< 1e-12 relative error). Adjoint gradients are checked against finite differences on independent code paths.

What hardware do I need?

Any modern x86 CPU with AVX2. No GPU. All benchmarks on this page ran on a single CPU. Multi-threading across 8+ cores is supported.

How does this compare to PyTorch autograd?

We implemented the same gas storage algorithm in PyTorch. It was 16x slower for training and 2,500x slower for evaluation. The difference is AADC's JIT compilation to AVX machine code vs Python interpreter overhead.

Can I use my own simulation code?

Yes. Write your simulation as a templated C++ function or use AADC's idouble type in Python. We handle the recording, optimization, and sensitivity computation.

Do I need machine learning expertise?

No. The neural policy, optimizer, and training loop are pre-built. You write the simulation and define the constraints.

Can RL eventually match this with enough training?

PPO captured 65% of the gas storage optimum after 3.7 hours. SAC failed on hydropower (-$5.8M loss). RL estimates gradients from noisy rewards; we compute them exactly. More training time doesn't close that gap.

How does AADC compare to NVIDIA Isaac Gym for robotics?

Isaac Gym uses simplified rigid-body physics on GPU with forward-mode AD. AADC differentiates through the real MuJoCo C code on CPU using reverse-mode AD. Isaac Gym is faster for massively parallel RL training. AADC is better for trajectory optimization, MPC, and system identification where physics fidelity and exact gradients matter more than throughput. Also runs on any CPU, no GPU clusters required.

How does AADC compare to JAX for physics simulation?

JAX re-traces the computation graph on each call. AADC records once and replays the compiled kernel indefinitely. For a 50-step MuJoCo trajectory, AADC completes 200 iterations in 57 ms. JAX-based alternatives (DiffMJX, Brax) get roughly 10x over FD. AADC gets 158x. And AADC differentiates through the real MuJoCo C code; JAX alternatives reimplement the physics.

How does this handle climate risk and NGFS scenarios?

The entire climate-financial chain goes on one AADC tape: carbon price dynamics, transition risk (PD/LGD from carbon costs), physical risk (temperature-dependent damage), and climate-adjusted XVA. One adjoint pass gives d(expected_loss)/d(carbon_price) and d(expected_loss)/d(temperature) simultaneously. The full 6-scenario x 3-horizon NGFS capital matrix computes in under a second. EIOPA compliance deadline is January 2027.

What medical applications does this support?

Five so far: ECG cardiac diagnostics (5-class, 0.914 beat-level AUC with exact adjoint interpretability), EEG seizure detection (per-patient channel selection in 1.3 seconds), ICU sepsis triage (12-state ODE with 7 intervention controls), insulin dosing (99.1% time-in-range), and pharma manufacturing (70 ICH Q8 sensitivities in 149 ms). All train on CPU in seconds to minutes.

Optimal decisions. Exact gradients. One pass.

Every industry has the same hard problem

Traditional approaches all hit walls:

Dynamic Programming

Reinforcement Learning (PPO, SAC)

Finite Differences

Heuristic rules

Differentiable Optimal Control

Three properties make this work:

Smooth constraints

Neural policy on tape

Adjoint mode

The Pattern

Record

Train

Deploy

What changes between applications:

What stays the same:

Applications

Gas Storage Optimization

Hydropower Reservoir Optimization

Battery Storage & Virtual Power Plants

VPP fleet scaling

Asset-Liability Management (ALM)

Real pension benchmarks:

Variable Annuity Valuation

Climate Risk Analytics (NGFS, Transition & Physical Risk)

NGFS capital stress results

Climate-adjusted XVA

Physical risk model

Transition risk model

Carbon price dynamics

Boiler NOx Emissions

Slurry Pipeline Optimization

Pharmaceutical Manufacturing (ICH Q8)

Weather Forecasting

Mount Everest (48h summit forecast)

Scaling: local to global

Differentiable Robotics with MuJoCo

Unique capabilities

Cardiac Diagnostics (ECG)

ICU Triage & Sepsis Management

Insulin Dosing (Glucose Control)

Seizure Detection (EEG)

Performance Summary

Training Speed (vs Alternatives)

Sensitivity Computation (vs Finite Differences)

Scaling (vs Dynamic Programming)

Frequently Asked Questions

What kind of problems can this solve?

How is this different from reinforcement learning?

How is this different from dynamic programming?

What do you mean by 'sensitivities for free'?

What accuracy guarantees do you provide?

What hardware do I need?

How does this compare to PyTorch autograd?

Can I use my own simulation code?

Do I need machine learning expertise?

Can RL eventually match this with enough training?

How does AADC compare to NVIDIA Isaac Gym for robotics?

How does AADC compare to JAX for physics simulation?

How does this handle climate risk and NGFS scenarios?

What medical applications does this support?

Talk to us about your problem

Related Solutions

MatLogica AADC Quantitative Development Toolkit

MatLogica Python Accelerator

AAD for Scientific Computing