diff --git a/.gitignore b/.gitignore
index 956e3a6..699c1b7 100644
--- a/.gitignore
+++ b/.gitignore
@@ -28,3 +28,4 @@
 /.DS_Store
 *.aux
 *.log
+.DS_Store
diff --git a/2023-02-rl/why-rl-exciting.html b/2023-02-rl/why-rl-exciting.html
index 9eea1c1..266f460 100644
--- a/2023-02-rl/why-rl-exciting.html
+++ b/2023-02-rl/why-rl-exciting.html
@@ -23,8 +23,8 @@
 <h2 class="author">James Brusey</h2>
 </section>
 <section>
-<section id="slide-orgbc5b926">
-<h2 id="orgbc5b926">Overview</h2>
+<section id="slide-orgc28b266">
+<h2 id="orgc28b266">Overview</h2>
 <ul>
 <li>What is Reinforcement Learning?
 <ul>
@@ -38,10 +38,10 @@
 </section>
 </section>
 <section>
-<section id="slide-org323969e">
-<h2 id="org323969e">What is Reinforcement Learning?</h2>
+<section id="slide-orgb0e190c">
+<h2 id="orgb0e190c">What is Reinforcement Learning?</h2>
 
-<div id="org15c9bf8" class="figure">
+<div id="org36ff4b8" class="figure">
 <p><img src="figures/helicopter_tail_rotor_thrust_antitorque_compensation.jpeg" alt="helicopter_tail_rotor_thrust_antitorque_compensation.jpeg" />
 </p>
 </div>
@@ -57,9 +57,9 @@
 </section>
 </section>
 <section>
-<section id="slide-org1a62ad4">
-<h3 id="org1a62ad4">What is Reinforcement Learning?</h3>
-<video controls muted data-autoplay src="file:figures/Stanford Autonomous Helicopter - Chaos-kN6ifrqwIMY.mp4"> </video>
+<section id="slide-org66ca3f4">
+<h3 id="org66ca3f4">What is Reinforcement Learning?</h3>
+<video controls muted data-autoplay src="figures/Stanford Autonomous Helicopter - Chaos-kN6ifrqwIMY.mp4"> </video>
 <aside class="notes">
 <ul>
 <li>How would you control a helicopter to perform this stunt?</li>
@@ -71,10 +71,10 @@
 </section>
 </section>
 <section>
-<section id="slide-org08d011a">
-<h3 id="org08d011a">What is Reinforcement Learning?</h3>
+<section id="slide-org97e2f20">
+<h3 id="org97e2f20">What is Reinforcement Learning?</h3>
 
-<div id="orge72a7f0" class="figure">
+<div id="orgb7f4b11" class="figure">
 <p><img src="figures/RLvsML.jpeg" alt="RLvsML.jpeg" />
 </p>
 </div>
@@ -90,8 +90,8 @@
 </section>
 </section>
 <section>
-<section id="slide-org7d26ac6">
-<h3 id="org7d26ac6">Some definitions</h3>
+<section id="slide-orgf38e5c0">
+<h3 id="orgf38e5c0">Some definitions</h3>
 <ul>
 <li class="fragment roll-in"><b>policy</b>&#x2014;how an agent behaves
 <ul>
@@ -138,9 +138,9 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org6d68439">
-<h3 id="org6d68439">Example: maze with pitfalls</h3>
-<video controls data-autoplay src="file:animation/media/videos/maze/1080p60/Maze.mp4"> </video>
+<section id="slide-org2f277aa">
+<h3 id="org2f277aa">Example: maze with pitfalls</h3>
+<video controls data-autoplay src="animation/media/videos/maze/1080p60/Maze.mp4"> </video>
 
 <aside class="notes">
 <ul>
@@ -160,9 +160,9 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-orgfc7e655">
-<h3 id="orgfc7e655">Example problem: Balance a pole</h3>
-<video data-autoplay loop width="640" height=400" src="file:figures/Inverted pendulum with trained agent (parametric)-E76S7YTDoek.mkv"> </video>
+<section id="slide-org7be8688">
+<h3 id="org7be8688">Example problem: Balance a pole</h3>
+<video data-autoplay loop width="640" height=400" src="figures/Inverted pendulum with trained agent (parametric)-E76S7YTDoek.mkv"> </video>
 <ul>
 <li>State: pole angle, angular momentum, cart position, velocity</li>
 <li>Actions: force on cart to left or right</li>
@@ -180,9 +180,9 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org59fc474">
-<h3 id="org59fc474">Example problem: Playing football</h3>
-<video data-autoplay muted width="640" height=400" src="file:figures/RoboCup 2019 - MSL Finals Recap - Tech United vs Water-_Y5_iGxWFrQ.mkv"> </video>
+<section id="slide-org9f05116">
+<h3 id="org9f05116">Example problem: Playing football</h3>
+<video data-autoplay muted width="640" height=400" src="figures/RoboCup 2019 - MSL Finals Recap - Tech United vs Water-_Y5_iGxWFrQ.mkv"> </video>
 <ul>
 <li>States: where am I? other players? ball?</li>
 <li>Actions: turn, run, pass, shoot, tackle</li>
@@ -200,14 +200,14 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org81c276b">
-<h3 id="org81c276b">A Brief History of RL</h3>
-<div class="outline-text-3" id="text-org81c276b">
+<section id="slide-org3c6f05f">
+<h3 id="org3c6f05f">A Brief History of RL</h3>
+<div class="outline-text-3" id="text-org3c6f05f">
 </div>
 </section>
-<section id="slide-orgc634f85">
-<h4 id="orgc634f85">Where does the term "reinforcement" come from?</h4>
-<video data-autoplay src="file:figures/Cats Ring Bells For Treats _ The Dodo-6lp-LPc3LGI.webm"></video>
+<section id="slide-orgb2a37bb">
+<h4 id="orgb2a37bb">Where does the term "reinforcement" come from?</h4>
+<video data-autoplay src="figures/Cats Ring Bells For Treats _ The Dodo-6lp-LPc3LGI.webm"></video>
 <aside class="notes">
 <ul>
 <li>Pavlov introduced conditioning which says that experience of rewards <i>reinforces</i> that action happening in the same situation next time</li>
@@ -216,9 +216,9 @@ So let's summarise the key aspects of RL
 
 </aside>
 </section>
-<section id="slide-orgc63f3ed">
-<h4 id="orgc63f3ed">TOBY (1951) - W. Grey Walter</h4>
-<video data-autoplay src="file:figures/Mechanical Tortoise (1951)-nosound.mkv"></video>
+<section id="slide-org3ea54c4">
+<h4 id="org3ea54c4">TOBY (1951) - W. Grey Walter</h4>
+<video data-autoplay src="figures/Mechanical Tortoise (1951)-nosound.mkv"></video>
 
 <aside class="notes">
 <ul>
@@ -229,9 +229,9 @@ So let's summarise the key aspects of RL
 </aside>
 
 </section>
-<section id="slide-orga1a1ba7">
-<h4 id="orga1a1ba7">Bellman equation (1957) and dynamic programming</h4>
-<video data-autoplay controls src="file:animation/media/videos/movingfbox/480p15/MovingFrameBox.mp4"> </video>
+<section id="slide-org67ec814">
+<h4 id="org67ec814">Bellman equation (1957) and dynamic programming</h4>
+<video data-autoplay controls src="animation/media/videos/movingfbox/480p15/MovingFrameBox.mp4"> </video>
 
 <aside class="notes">
 <ul>
@@ -258,25 +258,25 @@ So let's summarise the key aspects of RL
 
 
 </section>
-<section id="slide-org8f57c36">
-<h4 id="org8f57c36">Barto, Sutton and Anderson: Actor Critic (1983)</h4>
+<section id="slide-orgb90882e">
+<h4 id="orgb90882e">Barto, Sutton and Anderson: Actor Critic (1983)</h4>
 
-<div id="orgd1998db" class="figure">
+<div id="orgc94754f" class="figure">
 <p><img src="figures/figtmp34.png" alt="figtmp34.png" width="600px" align="left" />
 </p>
 </div>
 
-<div id="orga2e350f" class="figure">
+<div id="orgb252f70" class="figure">
 <p><img src="figures/sutton-head5.jpg" alt="sutton-head5.jpg" width="200px" align="right" />
 </p>
 </div>
 
-<div id="org461e828" class="figure">
+<div id="orgfb099ba" class="figure">
 <p><img src="figures/barto_andrew_crop.jpeg" alt="barto_andrew_crop.jpeg" width="200px" align="right" />
 </p>
 </div>
 
-<div id="org2319c1d" class="figure">
+<div id="orgdef77a3" class="figure">
 <p><img src="figures/Charles-Anderson.jpg" alt="Charles-Anderson.jpg" width="200px" align="right" />
 </p>
 </div>
@@ -296,10 +296,10 @@ So let's summarise the key aspects of RL
 </aside>
 
 </section>
-<section id="slide-orgc7ca90a">
-<h4 id="orgc7ca90a">Watkins Q-learning (1989)</h4>
+<section id="slide-orgcff9381">
+<h4 id="orgcff9381">Watkins Q-learning (1989)</h4>
 
-<div id="orgec1b388" class="figure">
+<div id="orga32d8f2" class="figure">
 <p><img src="figures/cw090311.jpg" alt="cw090311.jpg" width="200px" align="right" />
 </p>
 </div>
@@ -322,10 +322,10 @@ Q^{new}(s_{t},a_{t}) \leftarrow \underbrace{Q(s_{t},a_{t})}_{\text{old}} + \unde
 </aside>
 
 </section>
-<section id="slide-org5690ef1">
-<h4 id="org5690ef1">Tesauro's TD Gammon (1992)</h4>
+<section id="slide-org72d9011">
+<h4 id="org72d9011">Tesauro's TD Gammon (1992)</h4>
 
-<div id="org7048cde" class="figure">
+<div id="orgc3c24e7" class="figure">
 <p><img src="figures/td-gammon.png" alt="td-gammon.png" />
 </p>
 </div>
@@ -341,10 +341,10 @@ Q^{new}(s_{t},a_{t}) \leftarrow \underbrace{Q(s_{t},a_{t})}_{\text{old}} + \unde
 
 </aside>
 </section>
-<section id="slide-org113ed98">
-<h4 id="org113ed98">RL parallels in Neuroscience (1994-)</h4>
+<section id="slide-org383f7ce">
+<h4 id="org383f7ce">RL parallels in Neuroscience (1994-)</h4>
 
-<div id="orge974393" class="figure">
+<div id="orgf94725a" class="figure">
 <p><img src="figures/dopamine.png" alt="dopamine.png" />
 </p>
 </div>
@@ -363,10 +363,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org3170113">
-<h4 id="org3170113">My PhD work - RoboCup</h4>
+<section id="slide-org4c44611">
+<h4 id="org4c44611">My PhD work - RoboCup</h4>
 
-<div id="org2c8ffbd" class="figure">
+<div id="org7f78785" class="figure">
 <p><img src="figures/socbot1.png" alt="socbot1.png" />
 </p>
 </div>
@@ -378,10 +378,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org52d9bc0">
-<h4 id="org52d9bc0">Move to point (hand coded)</h4>
+<section id="slide-org67e6d84">
+<h4 id="org67e6d84">Move to point (hand coded)</h4>
 
-<div id="orgfac60cc" class="figure">
+<div id="org2bb737c" class="figure">
 <p><img src="figures/phys-hc-1.png" alt="phys-hc-1.png" />
 </p>
 </div>
@@ -395,10 +395,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org77402ca">
-<h4 id="org77402ca">Move to point (RL)</h4>
+<section id="slide-org96e1a51">
+<h4 id="org96e1a51">Move to point (RL)</h4>
 
-<div id="orgd05501e" class="figure">
+<div id="org2cb12bd" class="figure">
 <p><img src="figures/phys-mcsoft-1.png" alt="phys-mcsoft-1.png" />
 </p>
 </div>
@@ -412,10 +412,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org6609f47">
-<h4 id="org6609f47">Ball dribbling</h4>
+<section id="slide-org4611186">
+<h4 id="org4611186">Ball dribbling</h4>
 
-<div id="orgafb8044" class="figure">
+<div id="orge9103dd" class="figure">
 <p><img src="figures/sym2-0.png" alt="sym2-0.png" />
 </p>
 </div>
@@ -428,10 +428,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-orgcf845aa">
-<h4 id="orgcf845aa">Ball dribbling (hand coded)</h4>
+<section id="slide-org9b5fd59">
+<h4 id="org9b5fd59">Ball dribbling (hand coded)</h4>
 
-<div id="org1b5e92b" class="figure">
+<div id="org4ff6b00" class="figure">
 <p><img src="figures/t61.2.png" alt="t61.2.png" width="200px" />
 </p>
 </div>
@@ -443,10 +443,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgf23a9b9">
-<h4 id="orgf23a9b9">Ball dribbling (RL)</h4>
+<section id="slide-org8e5cf4c">
+<h4 id="org8e5cf4c">Ball dribbling (RL)</h4>
 
-<div id="orgaf50b52" class="figure">
+<div id="orgaeb8122" class="figure">
 <p><img src="figures/t62.12.png" alt="t62.12.png" />
 </p>
 </div>
@@ -459,9 +459,9 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org48e9ca6">
-<h4 id="org48e9ca6">Andrew Ng and Pieter Abbeel's Helicopter (2004)</h4>
-<video data-autoplay muted src="file:figures/Stanford Autonomous Helicopter - Airshow.mp4"></video>
+<section id="slide-org920052a">
+<h4 id="org920052a">Andrew Ng and Pieter Abbeel's Helicopter (2004)</h4>
+<video data-autoplay muted src="figures/Stanford Autonomous Helicopter - Airshow.mp4"></video>
 <aside class="notes">
 <ul>
 <li>Key for this talk is how they learnt each stunt in <i>simulation</i></li>
@@ -484,9 +484,9 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-orgb98bc88">
-<h4 id="orgb98bc88">Atari DQN Google DeepMind (2016) - Start of DeepRL</h4>
-<video data-autoplay src="file:figures/DQN Breakout-TmPfTpjtdgg.mp4" width="640" height="480"></video>
+<section id="slide-orgdab3d9f">
+<h4 id="orgdab3d9f">Atari DQN Google DeepMind (2016) - Start of DeepRL</h4>
+<video data-autoplay src="figures/DQN Breakout-TmPfTpjtdgg.mp4" width="640" height="480"></video>
 <aside class="notes">
 <ul>
 <li>prior to this - full access to internal state</li>
@@ -498,10 +498,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgad648d4">
-<h4 id="orgad648d4">AlphaGo and AlphaZero (Google DeepMind 2016)</h4>
+<section id="slide-org5b51c03">
+<h4 id="org5b51c03">AlphaGo and AlphaZero (Google DeepMind 2016)</h4>
 
-<div id="org278036e" class="figure">
+<div id="org44e788c" class="figure">
 <p><img src="figures/alphago.png" alt="alphago.png" />
 </p>
 </div>
@@ -517,9 +517,9 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgb14d985">
-<h4 id="orgb14d985">Sim to real: Quadruped robots</h4>
-<video data-autoplay src="file:figures/Sim-to-Real - Learning Agile Locomotion For Quadruped Robots-lUZUr7jxoqM.mp4"></video>
+<section id="slide-org36afb15">
+<h4 id="org36afb15">Sim to real: Quadruped robots</h4>
+<video data-autoplay src="figures/Sim-to-Real - Learning Agile Locomotion For Quadruped Robots-lUZUr7jxoqM.mp4"></video>
 <aside class="notes">
 <ul>
 <li>small problems in simulator lead to problems with real world performance</li>
@@ -529,9 +529,9 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org95bf2ec">
-<h4 id="org95bf2ec">OpenAI Rubik's cube robot</h4>
-<video data-autoplay src="file:figures/Solving Rubik’s Cube with a Robot Hand - Uncut-kVmp0uGtShk-quarter.mkv"></video>
+<section id="slide-org6555200">
+<h4 id="org6555200">OpenAI Rubik's cube robot</h4>
+<video data-autoplay src="figures/Solving Rubik’s Cube with a Robot Hand - Uncut-kVmp0uGtShk-quarter.mkv"></video>
 <aside class="notes">
 <ul>
 <li>training in simulation starts with deterministic simulation</li>
@@ -542,8 +542,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org08b06a8">
-<h4 id="org08b06a8">Learning to walk in 1 hour (Dreamer v3)</h4>
+<section id="slide-orgbe644e6">
+<h4 id="orgbe644e6">Learning to walk in 1 hour (Dreamer v3)</h4>
 <iframe width="1120" height="630" src="https://www.youtube.com/embed/xAXvfVTgqr0?si=IFBwVigK8rWQwFja" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
 <aside class="notes">
 <ul>
@@ -555,8 +555,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-orgee1ecf0">
-<h4 id="orgee1ecf0">Champion level drone racing using Deep RL (Oct 23)</h4>
+<section id="slide-org45d0d22">
+<h4 id="org45d0d22">Champion level drone racing using Deep RL (Oct 23)</h4>
 <iframe width="1120" height="630" src="https://www.youtube.com/embed/HGULBBAo5lA?si=PSEplhndR8N1lmR4" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
 
 <aside class="notes">
@@ -571,8 +571,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </section>
 </section>
 <section>
-<section id="slide-orgef1cd72">
-<h3 id="orgef1cd72">Key challenges for RL for real-world problems</h3>
+<section id="slide-orgbc348b9">
+<h3 id="orgbc348b9">Key challenges for RL for real-world problems</h3>
 <ul>
 <li>Common framework</li>
 <li>Resolve the environment problem</li>
@@ -581,8 +581,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </ul>
 </section>
-<section id="slide-org38d8bd9">
-<h4 id="org38d8bd9">Common framework</h4>
+<section id="slide-org599ccc5">
+<h4 id="org599ccc5">Common framework</h4>
 <ul>
 <li>RL is based on a well-structured problem formulation
 <ul>
@@ -606,8 +606,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org09bd04c">
-<h4 id="org09bd04c">Resolve the environment problem</h4>
+<section id="slide-org967710f">
+<h4 id="org967710f">Resolve the environment problem</h4>
 <ul>
 <li>Simple environments are easy - results are fast</li>
 <li>Bugs in the simulator can lead to poor control behaviour</li>
@@ -629,8 +629,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgf9788d6">
-<h4 id="orgf9788d6">Collect <i>open</i> data</h4>
+<section id="slide-orgb5b657e">
+<h4 id="orgb5b657e">Collect <i>open</i> data</h4>
 <ul>
 <li>Simulating environments from first principles tends to miss key characteristics
 <ul>
@@ -649,9 +649,9 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org4a723e3">
-<h4 id="org4a723e3">Consider the human element</h4>
-<video muted data-autoplay src="file:figures/Pedestrian-dynamics experiment - lane formation in counter flow-J4J__lOOV2E.mp4"></video>
+<section id="slide-orgc7921f0">
+<h4 id="orgc7921f0">Consider the human element</h4>
+<video muted data-autoplay src="figures/Pedestrian-dynamics experiment - lane formation in counter flow-J4J__lOOV2E.mp4"></video>
 <aside class="notes">
 <ul>
 <li>simple rules yield complex behaviour</li>
@@ -665,10 +665,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org3e2817c">
-<h2 id="org3e2817c">RL applied to electric vehicle comfort control</h2>
+<section id="slide-orgcf3fea1">
+<h2 id="orgcf3fea1">RL applied to electric vehicle comfort control</h2>
 
-<div id="org4d6f99f" class="figure">
+<div id="orgb980d49" class="figure">
 <p><img src="figures/car-air-conditioning-service.jpeg" alt="car-air-conditioning-service.jpeg" />
 </p>
 </div>
@@ -687,10 +687,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org165ecf4">
-<h3 id="org165ecf4">EV range issue</h3>
+<section id="slide-org9c6d09a">
+<h3 id="org9c6d09a">EV range issue</h3>
 
-<div id="orgcfe6e91" class="figure">
+<div id="org8757665" class="figure">
 <p><img src="figures/46-51_Cabin-Conditioning_atrApr19_1.jpeg" alt="46-51_Cabin-Conditioning_atrApr19_1.jpeg" />
 </p>
 </div>
@@ -704,10 +704,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org1263572">
-<h3 id="org1263572">Seat heating</h3>
+<section id="slide-org104e268">
+<h3 id="org104e268">Seat heating</h3>
 
-<div id="org07e6d70" class="figure">
+<div id="org3ecc753" class="figure">
 <p><img src="figures/heated-seats-button.jpeg" alt="heated-seats-button.jpeg" />
 </p>
 </div>
@@ -723,10 +723,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org5f0486d">
-<h3 id="org5f0486d">Natural ventilation</h3>
+<section id="slide-org01f0235">
+<h3 id="org01f0235">Natural ventilation</h3>
 
-<div id="orgf0cfc5d" class="figure">
+<div id="org067d68c" class="figure">
 <p><img src="figures/Coventry_University_Lanchester_Library_6933825422.jpeg" alt="Coventry_University_Lanchester_Library_6933825422.jpeg" />
 </p>
 </div>
@@ -741,10 +741,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-orgbb35f96">
-<h3 id="orgbb35f96">I've been working on it a while</h3>
+<section id="slide-orga44215e">
+<h3 id="orga44215e">I've been working on it a while</h3>
 
-<div id="org19d148d" class="figure">
+<div id="orgbc9a7e2" class="figure">
 <p><img src="figures/DSCF0052.jpg" alt="DSCF0052.jpg" />
 </p>
 </div>
@@ -756,10 +756,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org8a99514">
-<h4 id="org8a99514">H2020 EU Project - DOMUS</h4>
+<section id="slide-org1a75ac1">
+<h4 id="org1a75ac1">H2020 EU Project - DOMUS</h4>
 
-<div id="org313191a" class="figure">
+<div id="org3f95e1e" class="figure">
 <p><img src="figures/domus-partners.jpg" alt="domus-partners.jpg" />
 </p>
 </div>
@@ -771,10 +771,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-orga679647">
-<h4 id="orga679647">Climate control as an RL problem</h4>
+<section id="slide-org80be923">
+<h4 id="org80be923">Climate control as an RL problem</h4>
 
-<div id="org95eb6a6" class="figure">
+<div id="orgafb5f3f" class="figure">
 <p><img src="figures/comfort-problem.png" alt="comfort-problem.png" />
 </p>
 </div>
@@ -789,8 +789,8 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org8ff2c5a">
-<h3 id="org8ff2c5a">Producing a fast thermal cabin model</h3>
+<section id="slide-orgadd5879">
+<h3 id="orgadd5879">Producing a fast thermal cabin model</h3>
 <ul>
 <li>Let's focus on one aspect - the thermal cabin model</li>
 <li>Past work suggests that learning a comfort controller requires about 8 years of simulated experience</li>
@@ -806,10 +806,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org890db38">
-<h4 id="org890db38">Gathering data from the Climatic Wind Tunnel</h4>
+<section id="slide-org943a6fc">
+<h4 id="org943a6fc">Gathering data from the Climatic Wind Tunnel</h4>
 
-<div id="org2268dfe" class="figure">
+<div id="orgffd68de" class="figure">
 <p><img src="figures/cwt.png" alt="cwt.png" />
 </p>
 </div>
@@ -823,8 +823,8 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org17350cc">
-<h4 id="org17350cc">Accelerating the cabin model</h4>
+<section id="slide-orga3c1dbd">
+<h4 id="orga3c1dbd">Accelerating the cabin model</h4>
 <ul>
 <li>Key idea: it's possible to learn the cabin model from data
 \[ \mathbf{x}_{t+1} \approx \mathbf{f}_\theta \left( \mathbf{x}_t, \mathbf{u}_t, \mathbf{x}_{t-1},\ldots \right) \]
@@ -847,8 +847,8 @@ where
 
 </aside>
 </section>
-<section id="slide-orgc6d1ef1">
-<h4 id="orgc6d1ef1">Intuition for cabin model</h4>
+<section id="slide-org498e498">
+<h4 id="org498e498">Intuition for cabin model</h4>
 <ul>
 <li>Lumped thermal model is based on Newton's law of cooling
 \[  \frac{dy}{dt} = -k(y-y_0) \]</li>
@@ -871,8 +871,8 @@ where
 </aside>
 
 </section>
-<section id="slide-org6b25bb2">
-<h4 id="org6b25bb2">Intuition for cabin model</h4>
+<section id="slide-org8d3ac4c">
+<h4 id="org8d3ac4c">Intuition for cabin model</h4>
 <ul>
 <li><p>
 Therefore
@@ -902,8 +902,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </section>
 </section>
 <section>
-<section id="slide-org229b81c">
-<h3 id="org229b81c">Simulator results - driver foot, torso, head</h3>
+<section id="slide-org227c7fe">
+<h3 id="org227c7fe">Simulator results - driver foot, torso, head</h3>
 <p>
 <img src="figures/cwt-driver-head-foot.png" alt="cwt-driver-head-foot.png" />]]
 </p>
@@ -918,8 +918,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </aside>
 
 </section>
-<section id="slide-org4642df4">
-<h4 id="org4642df4">Results from this simulator</h4>
+<section id="slide-orgcf17aa4">
+<h4 id="orgcf17aa4">Results from this simulator</h4>
 <ul>
 <li>Linear Regression-based model NRMSE 1.8% overall
 <ul>
@@ -943,10 +943,10 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </aside>
 
 </section>
-<section id="slide-orgec9c57b">
-<h4 id="orgec9c57b">Preliminary results using RL</h4>
+<section id="slide-org681cea9">
+<h4 id="org681cea9">Preliminary results using RL</h4>
 
-<div id="org72f2416" class="figure">
+<div id="org85637ea" class="figure">
 <p><img src="figures/energyweight.png" alt="energyweight.png" />
 </p>
 </div>
@@ -962,8 +962,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </section>
 </section>
 <section>
-<section id="slide-orgab24089">
-<h2 id="orgab24089">Conclusions</h2>
+<section id="slide-org5f1c408">
+<h2 id="org5f1c408">Conclusions</h2>
 <ul>
 <li>RL is a very active and exciting domain</li>
 <li>Surprisingly it has made little inroads into real-world systems</li>
@@ -982,8 +982,8 @@ Focus on optimality
 </section>
 </section>
 <section>
-<section id="slide-orgd801138">
-<h2 id="orgd801138">Thank you</h2>
+<section id="slide-orgf64fcaa">
+<h2 id="orgf64fcaa">Thank you</h2>
 <p>
 Questions?
 </p>
diff --git a/2023-02-rl/why-rl-exciting.org b/2023-02-rl/why-rl-exciting.org
index 1504316..f084c96 100644
--- a/2023-02-rl/why-rl-exciting.org
+++ b/2023-02-rl/why-rl-exciting.org
@@ -22,7 +22,7 @@
 
 
 ** What is Reinforcement Learning?
-#+REVEAL_HTML: <video controls muted data-autoplay src="file:figures/Stanford Autonomous Helicopter - Chaos-kN6ifrqwIMY.mp4"> </video>
+#+REVEAL_HTML: <video controls muted data-autoplay src="figures/Stanford Autonomous Helicopter - Chaos-kN6ifrqwIMY.mp4"> </video>
 #+BEGIN_NOTES
 + How would you control a helicopter to perform this stunt?
 #+END_NOTES
@@ -58,7 +58,7 @@
 
 ** Example: maze with pitfalls
 
-#+REVEAL_HTML: <video controls data-autoplay src="file:animation/media/videos/maze/1080p60/Maze.mp4"> </video>
+#+REVEAL_HTML: <video controls data-autoplay src="animation/media/videos/maze/1080p60/Maze.mp4"> </video>
 
 #+BEGIN_NOTES
 + example problem - simple enough to derive a solution
@@ -72,7 +72,7 @@
 
 
 ** Example problem: Balance a pole
-#+REVEAL_HTML: <video data-autoplay loop width="640" height=400" src="file:figures/Inverted pendulum with trained agent (parametric)-E76S7YTDoek.mkv"> </video>
+#+REVEAL_HTML: <video data-autoplay loop width="640" height=400" src="figures/Inverted pendulum with trained agent (parametric)-E76S7YTDoek.mkv"> </video>
 - State: pole angle, angular momentum, cart position, velocity
 - Actions: force on cart to left or right
 - Reward: +1 for each time step that the pole is upright
@@ -81,7 +81,7 @@
 #+END_NOTES
 
 ** Example problem: Playing football
-#+REVEAL_HTML: <video data-autoplay muted width="640" height=400" src="file:figures/RoboCup 2019 - MSL Finals Recap - Tech United vs Water-_Y5_iGxWFrQ.mkv"> </video>
+#+REVEAL_HTML: <video data-autoplay muted width="640" height=400" src="figures/RoboCup 2019 - MSL Finals Recap - Tech United vs Water-_Y5_iGxWFrQ.mkv"> </video>
 - States: where am I? other players? ball?
 - Actions: turn, run, pass, shoot, tackle
 - Reward: 1 for win, 0 for draw, -1 for loss
@@ -92,13 +92,13 @@
 ** A Brief History of RL
 *** Where does the term "reinforcement" come from?
 
-#+REVEAL_HTML: <video data-autoplay src="file:figures/Cats Ring Bells For Treats _ The Dodo-6lp-LPc3LGI.webm"></video>
+#+REVEAL_HTML: <video data-autoplay src="figures/Cats Ring Bells For Treats _ The Dodo-6lp-LPc3LGI.webm"></video>
 #+BEGIN_NOTES
 + Pavlov introduced conditioning which says that experience of rewards /reinforces/ that action happening in the same situation next time
 #+END_NOTES
 *** TOBY (1951) - W. Grey Walter
 
-#+REVEAL_HTML: <video data-autoplay src="file:figures/Mechanical Tortoise (1951)-nosound.mkv"></video>
+#+REVEAL_HTML: <video data-autoplay src="figures/Mechanical Tortoise (1951)-nosound.mkv"></video>
 
 #+BEGIN_NOTES
 + By 1950, cybernetics theorised that behaviour was driven by /simple/ rules
@@ -106,7 +106,7 @@
 
 *** Bellman equation (1957) and dynamic programming
 
-#+REVEAL_HTML: <video data-autoplay controls src="file:animation/media/videos/movingfbox/480p15/MovingFrameBox.mp4"> </video>
+#+REVEAL_HTML: <video data-autoplay controls src="animation/media/videos/movingfbox/480p15/MovingFrameBox.mp4"> </video>
 
 #+BEGIN_NOTES
 + Recursive form became theoretical basis for RL
@@ -219,7 +219,7 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 #+END_NOTES
 
 *** Andrew Ng and Pieter Abbeel's Helicopter (2004)
-#+REVEAL_HTML: <video data-autoplay muted src="file:figures/Stanford Autonomous Helicopter - Airshow.mp4"></video>
+#+REVEAL_HTML: <video data-autoplay muted src="figures/Stanford Autonomous Helicopter - Airshow.mp4"></video>
  #+BEGIN_NOTES
 + Key for this talk is how they learnt each stunt in /simulation/
 
@@ -235,7 +235,7 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
  #+END_NOTES
 
 *** Atari DQN Google DeepMind (2016) - Start of DeepRL
-#+REVEAL_HTML: <video data-autoplay src="file:figures/DQN Breakout-TmPfTpjtdgg.mp4" width="640" height="480"></video>
+#+REVEAL_HTML: <video data-autoplay src="figures/DQN Breakout-TmPfTpjtdgg.mp4" width="640" height="480"></video>
 #+BEGIN_NOTES
 + prior to this - full access to internal state
 + this RL agent just sees pixel values
@@ -254,13 +254,13 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 #+END_NOTES
 *** Sim to real: Quadruped robots
-#+REVEAL_HTML: <video data-autoplay src="file:figures/Sim-to-Real - Learning Agile Locomotion For Quadruped Robots-lUZUr7jxoqM.mp4"></video>
+#+REVEAL_HTML: <video data-autoplay src="figures/Sim-to-Real - Learning Agile Locomotion For Quadruped Robots-lUZUr7jxoqM.mp4"></video>
 #+BEGIN_NOTES
 + small problems in simulator lead to problems with real world performance
 + however potential for simulator issues to be overcome
 #+END_NOTES
 *** OpenAI Rubik's cube robot
-#+REVEAL_HTML: <video data-autoplay src="file:figures/Solving Rubik’s Cube with a Robot Hand - Uncut-kVmp0uGtShk-quarter.mkv"></video>
+#+REVEAL_HTML: <video data-autoplay src="figures/Solving Rubik’s Cube with a Robot Hand - Uncut-kVmp0uGtShk-quarter.mkv"></video>
 #+BEGIN_NOTES
 + training in simulation starts with deterministic simulation
 
@@ -318,7 +318,7 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
  There is a lot of data being collected already but it is not always openly accessible
  #+END_NOTES
 *** Consider the human element
-#+REVEAL_HTML: <video muted data-autoplay src="file:figures/Pedestrian-dynamics experiment - lane formation in counter flow-J4J__lOOV2E.mp4"></video>
+#+REVEAL_HTML: <video muted data-autoplay src="figures/Pedestrian-dynamics experiment - lane formation in counter flow-J4J__lOOV2E.mp4"></video>
  #+BEGIN_NOTES
  + simple rules yield complex behaviour
  + we shouldn't ignore this problem just because it is hard