diff --git a/2023-02-rl/why-rl-exciting.html b/2023-02-rl/why-rl-exciting.html
index 0bc0926..2c4e4d9 100644
--- a/2023-02-rl/why-rl-exciting.html
+++ b/2023-02-rl/why-rl-exciting.html
@@ -23,8 +23,8 @@
 <h2 class="author">James Brusey</h2>
 </section>
 <section>
-<section id="slide-orga115019">
-<h2 id="orga115019">Overview</h2>
+<section id="slide-org19ddb40">
+<h2 id="org19ddb40">Overview</h2>
 <ul>
 <li>What is Reinforcement Learning?
 <ul>
@@ -38,10 +38,10 @@
 </section>
 </section>
 <section>
-<section id="slide-orge688b4b">
-<h2 id="orge688b4b">What is Reinforcement Learning?</h2>
+<section id="slide-org5258769">
+<h2 id="org5258769">What is Reinforcement Learning?</h2>
 
-<div id="orga5365be" class="figure">
+<div id="org1c83e48" class="figure">
 <p><img src="figures/helicopter_tail_rotor_thrust_antitorque_compensation.jpeg" alt="helicopter_tail_rotor_thrust_antitorque_compensation.jpeg" />
 </p>
 </div>
@@ -57,8 +57,8 @@
 </section>
 </section>
 <section>
-<section id="slide-org7211d28">
-<h3 id="org7211d28">What is Reinforcement Learning?</h3>
+<section id="slide-orgfa8f92d">
+<h3 id="orgfa8f92d">What is Reinforcement Learning?</h3>
 <video controls muted data-autoplay src="file:figures/Stanford Autonomous Helicopter - Chaos-kN6ifrqwIMY.mp4"> </video>
 <aside class="notes">
 <ul>
@@ -71,10 +71,10 @@
 </section>
 </section>
 <section>
-<section id="slide-org52fd71b">
-<h3 id="org52fd71b">What is Reinforcement Learning?</h3>
+<section id="slide-org48a6de2">
+<h3 id="org48a6de2">What is Reinforcement Learning?</h3>
 
-<div id="org1d19b2f" class="figure">
+<div id="org370bd40" class="figure">
 <p><img src="figures/RLvsML.jpeg" alt="RLvsML.jpeg" />
 </p>
 </div>
@@ -90,8 +90,8 @@
 </section>
 </section>
 <section>
-<section id="slide-org0ec1abf">
-<h3 id="org0ec1abf">Some definitions</h3>
+<section id="slide-org9893051">
+<h3 id="org9893051">Some definitions</h3>
 <ul>
 <li class="fragment roll-in"><b>policy</b>&#x2014;how an agent behaves
 <ul>
@@ -138,8 +138,8 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org69960ec">
-<h3 id="org69960ec">Example: maze with pitfalls</h3>
+<section id="slide-org084820b">
+<h3 id="org084820b">Example: maze with pitfalls</h3>
 <video controls data-autoplay src="file:animation/media/videos/maze/1080p60/Maze.mp4"> </video>
 
 <aside class="notes">
@@ -160,8 +160,8 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-orgf1bb810">
-<h3 id="orgf1bb810">Example problem: Balance a pole</h3>
+<section id="slide-orgafb1f8a">
+<h3 id="orgafb1f8a">Example problem: Balance a pole</h3>
 <video data-autoplay loop width="640" height=400" src="file:figures/Inverted pendulum with trained agent (parametric)-E76S7YTDoek.mkv"> </video>
 <ul>
 <li>State: pole angle, angular momentum, cart position, velocity</li>
@@ -180,8 +180,8 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org6a87360">
-<h3 id="org6a87360">Example problem: Playing football</h3>
+<section id="slide-orgc721362">
+<h3 id="orgc721362">Example problem: Playing football</h3>
 <video data-autoplay muted width="640" height=400" src="file:figures/RoboCup 2019 - MSL Finals Recap - Tech United vs Water-_Y5_iGxWFrQ.mkv"> </video>
 <ul>
 <li>States: where am I? other players? ball?</li>
@@ -200,13 +200,13 @@ So let's summarise the key aspects of RL
 </section>
 </section>
 <section>
-<section id="slide-org5289d38">
-<h3 id="org5289d38">A Brief History of RL</h3>
-<div class="outline-text-3" id="text-org5289d38">
+<section id="slide-org41ba53a">
+<h3 id="org41ba53a">A Brief History of RL</h3>
+<div class="outline-text-3" id="text-org41ba53a">
 </div>
 </section>
-<section id="slide-org1d8e520">
-<h4 id="org1d8e520">Where does the term "reinforcement" come from?</h4>
+<section id="slide-org713e728">
+<h4 id="org713e728">Where does the term "reinforcement" come from?</h4>
 <video data-autoplay src="file:figures/Cats Ring Bells For Treats _ The Dodo-6lp-LPc3LGI.webm"></video>
 <aside class="notes">
 <ul>
@@ -216,8 +216,8 @@ So let's summarise the key aspects of RL
 
 </aside>
 </section>
-<section id="slide-orgfd522b3">
-<h4 id="orgfd522b3">TOBY (1951) - W. Grey Walter</h4>
+<section id="slide-org9cdca23">
+<h4 id="org9cdca23">TOBY (1951) - W. Grey Walter</h4>
 <video data-autoplay src="file:figures/Mechanical Tortoise (1951)-nosound.mkv"></video>
 
 <aside class="notes">
@@ -229,8 +229,8 @@ So let's summarise the key aspects of RL
 </aside>
 
 </section>
-<section id="slide-org15931f4">
-<h4 id="org15931f4">Bellman equation (1957) and dynamic programming</h4>
+<section id="slide-org9144449">
+<h4 id="org9144449">Bellman equation (1957) and dynamic programming</h4>
 <video data-autoplay controls src="file:animation/media/videos/movingfbox/480p15/MovingFrameBox.mp4"> </video>
 
 <aside class="notes">
@@ -258,25 +258,25 @@ So let's summarise the key aspects of RL
 
 
 </section>
-<section id="slide-orga9706f6">
-<h4 id="orga9706f6">Barto, Sutton and Anderson: Actor Critic (1983)</h4>
+<section id="slide-org22534e9">
+<h4 id="org22534e9">Barto, Sutton and Anderson: Actor Critic (1983)</h4>
 
-<div id="orgb65ab04" class="figure">
+<div id="org7dc9395" class="figure">
 <p><img src="figures/figtmp34.png" alt="figtmp34.png" width="600px" align="left" />
 </p>
 </div>
 
-<div id="org0ea6de2" class="figure">
+<div id="org082c7d1" class="figure">
 <p><img src="figures/sutton-head5.jpg" alt="sutton-head5.jpg" width="200px" align="right" />
 </p>
 </div>
 
-<div id="org9965fe4" class="figure">
+<div id="orga815120" class="figure">
 <p><img src="figures/barto_andrew_crop.jpeg" alt="barto_andrew_crop.jpeg" width="200px" align="right" />
 </p>
 </div>
 
-<div id="orga96ba9f" class="figure">
+<div id="orgcd2ee56" class="figure">
 <p><img src="figures/Charles-Anderson.jpg" alt="Charles-Anderson.jpg" width="200px" align="right" />
 </p>
 </div>
@@ -296,10 +296,10 @@ So let's summarise the key aspects of RL
 </aside>
 
 </section>
-<section id="slide-orgb817e7f">
-<h4 id="orgb817e7f">Watkins Q-learning (1989)</h4>
+<section id="slide-org18fabd2">
+<h4 id="org18fabd2">Watkins Q-learning (1989)</h4>
 
-<div id="org915edce" class="figure">
+<div id="orgb56e067" class="figure">
 <p><img src="figures/cw090311.jpg" alt="cw090311.jpg" width="200px" align="right" />
 </p>
 </div>
@@ -322,10 +322,10 @@ Q^{new}(s_{t},a_{t}) \leftarrow \underbrace{Q(s_{t},a_{t})}_{\text{old}} + \unde
 </aside>
 
 </section>
-<section id="slide-org00b0990">
-<h4 id="org00b0990">Tesauro's TD Gammon (1992)</h4>
+<section id="slide-org49dcfcc">
+<h4 id="org49dcfcc">Tesauro's TD Gammon (1992)</h4>
 
-<div id="org1bf994c" class="figure">
+<div id="org044722f" class="figure">
 <p><img src="figures/td-gammon.png" alt="td-gammon.png" />
 </p>
 </div>
@@ -341,10 +341,10 @@ Q^{new}(s_{t},a_{t}) \leftarrow \underbrace{Q(s_{t},a_{t})}_{\text{old}} + \unde
 
 </aside>
 </section>
-<section id="slide-orgd74a07a">
-<h4 id="orgd74a07a">RL parallels in Neuroscience (1994-)</h4>
+<section id="slide-org63bee65">
+<h4 id="org63bee65">RL parallels in Neuroscience (1994-)</h4>
 
-<div id="orga47478b" class="figure">
+<div id="orge3fa882" class="figure">
 <p><img src="figures/dopamine.png" alt="dopamine.png" />
 </p>
 </div>
@@ -363,10 +363,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org857661d">
-<h4 id="org857661d">My PhD work - RoboCup</h4>
+<section id="slide-orgd6c1687">
+<h4 id="orgd6c1687">My PhD work - RoboCup</h4>
 
-<div id="org4ffa7f3" class="figure">
+<div id="orgf89e6fa" class="figure">
 <p><img src="figures/socbot1.png" alt="socbot1.png" />
 </p>
 </div>
@@ -378,10 +378,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org8b54f51">
-<h4 id="org8b54f51">Move to point (hand coded)</h4>
+<section id="slide-org5a0505f">
+<h4 id="org5a0505f">Move to point (hand coded)</h4>
 
-<div id="org49f403d" class="figure">
+<div id="orgce90363" class="figure">
 <p><img src="figures/phys-hc-1.png" alt="phys-hc-1.png" />
 </p>
 </div>
@@ -395,10 +395,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org676eedd">
-<h4 id="org676eedd">Move to point (RL)</h4>
+<section id="slide-org409715b">
+<h4 id="org409715b">Move to point (RL)</h4>
 
-<div id="orgd8e7a81" class="figure">
+<div id="orgf4121ca" class="figure">
 <p><img src="figures/phys-mcsoft-1.png" alt="phys-mcsoft-1.png" />
 </p>
 </div>
@@ -412,10 +412,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org4758368">
-<h4 id="org4758368">Ball dribbling</h4>
+<section id="slide-orge735bc7">
+<h4 id="orge735bc7">Ball dribbling</h4>
 
-<div id="org42fb566" class="figure">
+<div id="orgd347d89" class="figure">
 <p><img src="figures/sym2-0.png" alt="sym2-0.png" />
 </p>
 </div>
@@ -428,10 +428,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-orgf0504fa">
-<h4 id="orgf0504fa">Ball dribbling (hand coded)</h4>
+<section id="slide-org402a3e6">
+<h4 id="org402a3e6">Ball dribbling (hand coded)</h4>
 
-<div id="org38e7c75" class="figure">
+<div id="org8adaa58" class="figure">
 <p><img src="figures/t61.2.png" alt="t61.2.png" width="200px" />
 </p>
 </div>
@@ -443,10 +443,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgbcfbd9e">
-<h4 id="orgbcfbd9e">Ball dribbling (RL)</h4>
+<section id="slide-orgf2fa7fc">
+<h4 id="orgf2fa7fc">Ball dribbling (RL)</h4>
 
-<div id="org6719f2d" class="figure">
+<div id="orgb352a17" class="figure">
 <p><img src="figures/t62.12.png" alt="t62.12.png" />
 </p>
 </div>
@@ -459,8 +459,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org37778d4">
-<h4 id="org37778d4">Andrew Ng and Pieter Abbeel's Helicopter (2004)</h4>
+<section id="slide-org5a741bc">
+<h4 id="org5a741bc">Andrew Ng and Pieter Abbeel's Helicopter (2004)</h4>
 <video data-autoplay muted src="file:figures/Stanford Autonomous Helicopter - Airshow.mp4"></video>
 <aside class="notes">
 <ul>
@@ -484,8 +484,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 </aside>
 
 </section>
-<section id="slide-org416d136">
-<h4 id="org416d136">Atari DQN Google DeepMind (2016) - Start of DeepRL</h4>
+<section id="slide-org45e33dc">
+<h4 id="org45e33dc">Atari DQN Google DeepMind (2016) - Start of DeepRL</h4>
 <video data-autoplay src="file:figures/DQN Breakout-TmPfTpjtdgg.mp4" width="640" height="480"></video>
 <aside class="notes">
 <ul>
@@ -498,10 +498,10 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org4826550">
-<h4 id="org4826550">AlphaGo and AlphaZero (Google DeepMind 2016)</h4>
+<section id="slide-org3d48f2a">
+<h4 id="org3d48f2a">AlphaGo and AlphaZero (Google DeepMind 2016)</h4>
 
-<div id="orgd4b430d" class="figure">
+<div id="orgc938a50" class="figure">
 <p><img src="figures/alphago.png" alt="alphago.png" />
 </p>
 </div>
@@ -517,8 +517,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgfa2848b">
-<h4 id="orgfa2848b">Sim to real: Quadruped robots</h4>
+<section id="slide-org4b59750">
+<h4 id="org4b59750">Sim to real: Quadruped robots</h4>
 <video data-autoplay src="file:figures/Sim-to-Real - Learning Agile Locomotion For Quadruped Robots-lUZUr7jxoqM.mp4"></video>
 <aside class="notes">
 <ul>
@@ -529,8 +529,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org4d7afbe">
-<h4 id="org4d7afbe">OpenAI Rubik's cube robot</h4>
+<section id="slide-org493f16d">
+<h4 id="org493f16d">OpenAI Rubik's cube robot</h4>
 <video data-autoplay src="file:figures/Solving Rubik’s Cube with a Robot Hand - Uncut-kVmp0uGtShk-quarter.mkv"></video>
 <aside class="notes">
 <ul>
@@ -542,18 +542,37 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-org7a7032b">
-<h4 id="org7a7032b">Champion level drone racing using Deep RL</h4>
-<iframe width="1120" height="630" src="https://www.youtube.com/embed/fBiataDpGIo?si=BY2UYmxbxgOB160_" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+<section id="slide-org0da54fd">
+<h4 id="org0da54fd">Learning to walk in 1 hour (Dreamer v3)</h4>
+<iframe width="1120" height="630" src="https://www.youtube.com/embed/xAXvfVTgqr0?si=IFBwVigK8rWQwFja" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+<aside class="notes">
+<ul>
+<li>why isn't the walking behaviour better?</li>
+<li>look at how well it recovers from being knocked over</li>
+
+</ul>
+
+</aside>
 
 </section>
-<section id="slide-org4a81557">
-<h4 id="org4a81557">width="560" height="315"</h4>
+<section id="slide-org7715378">
+<h4 id="org7715378">Champion level drone racing using Deep RL (Oct 23)</h4>
+<iframe width="1120" height="630" src="https://www.youtube.com/embed/HGULBBAo5lA?si=PSEplhndR8N1lmR4" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+
+<aside class="notes">
+<ul>
+<li>This is amazing because it solves the problem of sim2real for a difficult problem</li>
+<li>There are still a lot of technical challenges here to do with state estimation</li>
+
+</ul>
+
+</aside>
+
 </section>
 </section>
 <section>
-<section id="slide-orgbf56364">
-<h3 id="orgbf56364">Key challenges for RL for real-world problems</h3>
+<section id="slide-orgdd68f3c">
+<h3 id="orgdd68f3c">Key challenges for RL for real-world problems</h3>
 <ul>
 <li>Common framework</li>
 <li>Resolve the environment problem</li>
@@ -562,8 +581,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </ul>
 </section>
-<section id="slide-org6902943">
-<h4 id="org6902943">Common framework</h4>
+<section id="slide-org2d64996">
+<h4 id="org2d64996">Common framework</h4>
 <ul>
 <li>RL is based on a well-structured problem formulation
 <ul>
@@ -587,8 +606,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgb4d964c">
-<h4 id="orgb4d964c">Resolve the environment problem</h4>
+<section id="slide-org021a4fb">
+<h4 id="org021a4fb">Resolve the environment problem</h4>
 <ul>
 <li>Simple environments are easy - results are fast</li>
 <li>Bugs in the simulator can lead to poor control behaviour</li>
@@ -610,8 +629,8 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 
 </aside>
 </section>
-<section id="slide-orgd63fc94">
-<h4 id="orgd63fc94">Collect <i>open</i> data</h4>
+<section id="slide-orga902422">
+<h4 id="orga902422">Collect <i>open</i> data</h4>
 <ul>
 <li>Simulating environments from first principles tends to miss key characteristics
 <ul>
@@ -630,8 +649,8 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org44a6084">
-<h4 id="org44a6084">Consider the human element</h4>
+<section id="slide-org31762dc">
+<h4 id="org31762dc">Consider the human element</h4>
 <video muted data-autoplay src="file:figures/Pedestrian-dynamics experiment - lane formation in counter flow-J4J__lOOV2E.mp4"></video>
 <aside class="notes">
 <ul>
@@ -646,10 +665,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org42506c3">
-<h2 id="org42506c3">RL applied to electric vehicle comfort control</h2>
+<section id="slide-org953a586">
+<h2 id="org953a586">RL applied to electric vehicle comfort control</h2>
 
-<div id="org7bf9910" class="figure">
+<div id="orgd3ba8df" class="figure">
 <p><img src="figures/car-air-conditioning-service.jpeg" alt="car-air-conditioning-service.jpeg" />
 </p>
 </div>
@@ -668,10 +687,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-orgecc296f">
-<h3 id="orgecc296f">EV range issue</h3>
+<section id="slide-org5c6fb23">
+<h3 id="org5c6fb23">EV range issue</h3>
 
-<div id="orgee702a3" class="figure">
+<div id="org72a8eef" class="figure">
 <p><img src="figures/46-51_Cabin-Conditioning_atrApr19_1.jpeg" alt="46-51_Cabin-Conditioning_atrApr19_1.jpeg" />
 </p>
 </div>
@@ -685,10 +704,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-orgc883469">
-<h3 id="orgc883469">Seat heating</h3>
+<section id="slide-orge742cb3">
+<h3 id="orge742cb3">Seat heating</h3>
 
-<div id="orgd653706" class="figure">
+<div id="org6b31f96" class="figure">
 <p><img src="figures/heated-seats-button.jpeg" alt="heated-seats-button.jpeg" />
 </p>
 </div>
@@ -704,10 +723,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-org2e9962d">
-<h3 id="org2e9962d">Natural ventilation</h3>
+<section id="slide-orgd313d7e">
+<h3 id="orgd313d7e">Natural ventilation</h3>
 
-<div id="org79f0dc6" class="figure">
+<div id="org403f350" class="figure">
 <p><img src="figures/Coventry_University_Lanchester_Library_6933825422.jpeg" alt="Coventry_University_Lanchester_Library_6933825422.jpeg" />
 </p>
 </div>
@@ -722,10 +741,10 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-orgd5c63cb">
-<h3 id="orgd5c63cb">I've been working on it a while</h3>
+<section id="slide-orgb3fbd9e">
+<h3 id="orgb3fbd9e">I've been working on it a while</h3>
 
-<div id="orgb69989e" class="figure">
+<div id="orgc69126e" class="figure">
 <p><img src="figures/DSCF0052.jpg" alt="DSCF0052.jpg" />
 </p>
 </div>
@@ -737,10 +756,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org0e932fc">
-<h4 id="org0e932fc">H2020 EU Project - DOMUS</h4>
+<section id="slide-org58d6d2d">
+<h4 id="org58d6d2d">H2020 EU Project - DOMUS</h4>
 
-<div id="org4e0cc0b" class="figure">
+<div id="org4d92782" class="figure">
 <p><img src="figures/domus-partners.jpg" alt="domus-partners.jpg" />
 </p>
 </div>
@@ -752,10 +771,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-orgf73aad4">
-<h4 id="orgf73aad4">Climate control as an RL problem</h4>
+<section id="slide-org24648e9">
+<h4 id="org24648e9">Climate control as an RL problem</h4>
 
-<div id="org1434b5d" class="figure">
+<div id="org96da0c4" class="figure">
 <p><img src="figures/comfort-problem.png" alt="comfort-problem.png" />
 </p>
 </div>
@@ -770,8 +789,8 @@ There is a lot of data being collected already but it is not always openly acces
 </section>
 </section>
 <section>
-<section id="slide-orgf7c6134">
-<h3 id="orgf7c6134">Producing a fast thermal cabin model</h3>
+<section id="slide-org0fbb495">
+<h3 id="org0fbb495">Producing a fast thermal cabin model</h3>
 <ul>
 <li>Let's focus on one aspect - the thermal cabin model</li>
 <li>Past work suggests that learning a comfort controller requires about 8 years of simulated experience</li>
@@ -787,10 +806,10 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org0135290">
-<h4 id="org0135290">Gathering data from the Climatic Wind Tunnel</h4>
+<section id="slide-orgf8f2dbe">
+<h4 id="orgf8f2dbe">Gathering data from the Climatic Wind Tunnel</h4>
 
-<div id="orgb9d37c4" class="figure">
+<div id="org8de3b23" class="figure">
 <p><img src="figures/cwt.png" alt="cwt.png" />
 </p>
 </div>
@@ -804,8 +823,8 @@ There is a lot of data being collected already but it is not always openly acces
 
 </aside>
 </section>
-<section id="slide-org937b90b">
-<h4 id="org937b90b">Accelerating the cabin model</h4>
+<section id="slide-orgfbd2616">
+<h4 id="orgfbd2616">Accelerating the cabin model</h4>
 <ul>
 <li>Key idea: it's possible to learn the cabin model from data
 \[ \mathbf{x}_{t+1} \approx \mathbf{f}_\theta \left( \mathbf{x}_t, \mathbf{u}_t, \mathbf{x}_{t-1},\ldots \right) \]
@@ -828,8 +847,8 @@ where
 
 </aside>
 </section>
-<section id="slide-org1430db7">
-<h4 id="org1430db7">Intuition for cabin model</h4>
+<section id="slide-org12ad454">
+<h4 id="org12ad454">Intuition for cabin model</h4>
 <ul>
 <li>Lumped thermal model is based on Newton's law of cooling
 \[  \frac{dy}{dt} = -k(y-y_0) \]</li>
@@ -852,8 +871,8 @@ where
 </aside>
 
 </section>
-<section id="slide-org364dde4">
-<h4 id="org364dde4">Intuition for cabin model</h4>
+<section id="slide-orgc0437dc">
+<h4 id="orgc0437dc">Intuition for cabin model</h4>
 <ul>
 <li><p>
 Therefore
@@ -883,8 +902,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </section>
 </section>
 <section>
-<section id="slide-org136175b">
-<h3 id="org136175b">Simulator results - driver foot, torso, head</h3>
+<section id="slide-org2406966">
+<h3 id="org2406966">Simulator results - driver foot, torso, head</h3>
 <p>
 <img src="figures/cwt-driver-head-foot.png" alt="cwt-driver-head-foot.png" />]]
 </p>
@@ -899,8 +918,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </aside>
 
 </section>
-<section id="slide-orgfcd2ff3">
-<h4 id="orgfcd2ff3">Results from this simulator</h4>
+<section id="slide-orge110e88">
+<h4 id="orge110e88">Results from this simulator</h4>
 <ul>
 <li>Linear Regression-based model NRMSE 1.8% overall
 <ul>
@@ -924,10 +943,10 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </aside>
 
 </section>
-<section id="slide-org6bba1d3">
-<h4 id="org6bba1d3">Preliminary results using RL</h4>
+<section id="slide-org827fa32">
+<h4 id="org827fa32">Preliminary results using RL</h4>
 
-<div id="org7f8327e" class="figure">
+<div id="org4459ffb" class="figure">
 <p><img src="figures/energyweight.png" alt="energyweight.png" />
 </p>
 </div>
@@ -943,8 +962,8 @@ y(t+\Delta t) &\approx y(t) + \frac{\Delta y}{\Delta t}\cdot \Delta t \\
 </section>
 </section>
 <section>
-<section id="slide-org8140b01">
-<h2 id="org8140b01">Conclusions</h2>
+<section id="slide-orgc9617c3">
+<h2 id="orgc9617c3">Conclusions</h2>
 <ul>
 <li>RL is a very active and exciting domain</li>
 <li>Surprisingly it has made little inroads into real-world systems</li>
@@ -963,8 +982,8 @@ Focus on optimality
 </section>
 </section>
 <section>
-<section id="slide-orgfe491ff">
-<h2 id="orgfe491ff">Thank you</h2>
+<section id="slide-org949c8f9">
+<h2 id="org949c8f9">Thank you</h2>
 <p>
 Questions?
 </p>
diff --git a/2023-02-rl/why-rl-exciting.org b/2023-02-rl/why-rl-exciting.org
index ef098ee..0de5b4c 100644
--- a/2023-02-rl/why-rl-exciting.org
+++ b/2023-02-rl/why-rl-exciting.org
@@ -265,8 +265,20 @@ The dopamine response coding an error in the prediction of reward (Eq. 1) closel
 + progressively adds more randomisation during learning so that when transferred to real robot, behaviour is more robust
 
 #+END_NOTES
-*** Champion level drone racing using Deep RL
-#+REVEAL_HTML: <iframe width="1120" height="630" src="https://www.youtube.com/embed/fBiataDpGIo?si=BY2UYmxbxgOB160_" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+*** Learning to walk in 1 hour (Dreamer v3)
+#+REVEAL_HTML: <iframe width="1120" height="630" src="https://www.youtube.com/embed/xAXvfVTgqr0?si=IFBwVigK8rWQwFja" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+#+BEGIN_NOTES
++ why isn't the walking behaviour better?
++ look at how well it recovers from being knocked over
+#+END_NOTES  
+
+*** Champion level drone racing using Deep RL (Oct 23)
+#+REVEAL_HTML: <iframe width="1120" height="630" src="https://www.youtube.com/embed/HGULBBAo5lA?si=PSEplhndR8N1lmR4" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
+
+#+BEGIN_NOTES
++ This is amazing because it solves the problem of sim2real for a difficult problem
++ There are still a lot of technical challenges here to do with state estimation
+#+END_NOTES
 
 ** Key challenges for RL for real-world problems
 - Common framework
diff --git a/2024-01-ctpsr-ai/ctpsr-ai.org b/2024-01-ctpsr-ai/ctpsr-ai.org
new file mode 100644
index 0000000..6b71075
--- /dev/null
+++ b/2024-01-ctpsr-ai/ctpsr-ai.org
@@ -0,0 +1,62 @@
+#+title: Rough intro to AI
+#+date: 11 January 2024
+#+property: header-args:ipython :session session1 :results output raw drawer :exports both
+#+options: toc:nil H:1
+#+startup: beamer
+#+latex_class: beamer
+#+latex_class_options:
+#+beamer_theme: Boadilla
+#+latex_header: \usepackage{natbib}
+#+description:
+#+keywords:
+#+subtitle:
+#+latex_compiler: pdflatex
+
+* Some talking points
++ Understanding of the main AI tools currently available and what might change over the next few months
++ Current uses in research
++ Opportunities and risks
++ Ethical considerations
++ CU capabilities and how these are evolving
+
+* What AI thinks of AI
+
+#+attr_latex: :height 0.8\textheight
+[[file:figures/DALL-E 2024-01-11 10.54.12 - An educational illustration showcasing different types of artificial intelligence. The image is divided into several sections, each representing a dif.png]]
+
+* Overview of AI
++ Machine learning
+  + Supervised
+    + Take a picture and recognise a digit, dog, tank, \ldots
+    + Generative
+  + Unsupervised
+  + Reinforcement Learning
+
+* Importance of Openness for LLMs
++ Technology for OpenAI GPT-4 is proprietary (as are Bard / Claude)
++ Open source systems (e.g., Mistral) getting better though
++ Need to avoid lock-in
++ Need to know what is inside
+
+
+* Similarities between LLMs and the Human Brain
+  - Physical and Functional Differences
+    \note{Acknowledge that the physical hardware and many mechanisms of neural networks are distinct from the human brain.}
+  - Large-Scale Complexity
+    \note{Emphasize how, at a macro level, the complexities and capabilities of Large Language Models (LLMs) can appear remarkably similar to certain functions of the human brain.}
+  - Pattern Recognition and Learning
+    \note{Highlight the similarities in how both LLMs and the human brain learn from vast amounts of data and recognize patterns.}
+  - Limitations in Comparison
+    \note{Caution against overstating the comparison, as the human brain's workings are vastly more complex and less understood.}
+
+* Current Limitations and Future Possibilities of LLMs
+  - Lack of Memory and Contextual Understanding
+    \note{Explain how LLMs, unlike the human brain, do not possess real memory but use context to create an illusion of continuity and understanding.}
+  - Output Restrictions
+    \note{Note that LLMs are currently limited to text output, lacking the ability to perform actions or interact with the environment.}
+  - No Embodiment or Sensory Perception
+    \note{Highlight the absence of a physical or sensory presence in LLMs, limiting their understanding of the real world.}
+  - Absence of Emotional Intelligence
+    \note{Discuss the lack of emotional capacity in LLMs, differentiating them significantly from human cognitive and emotional processes.}
+  - Potential for Future Advancements
+    \note{Speculate on the future evolution of AI, suggesting that current limitations like memory, embodiment, sensory perception, and emotional intelligence might be overcome in the next 20 years, leading to more advanced and human-like AI capabilities.}
diff --git a/2024-01-ctpsr-ai/ctpsr-ai.pdf b/2024-01-ctpsr-ai/ctpsr-ai.pdf
new file mode 100644
index 0000000..ebf84bf
Binary files /dev/null and b/2024-01-ctpsr-ai/ctpsr-ai.pdf differ
diff --git a/2024-01-ctpsr-ai/figures/DALL-E 2024-01-11 10.54.12 - An educational illustration showcasing different types of artificial intelligence. The image is divided into several sections, each representing a dif.png b/2024-01-ctpsr-ai/figures/DALL-E 2024-01-11 10.54.12 - An educational illustration showcasing different types of artificial intelligence. The image is divided into several sections, each representing a dif.png
new file mode 100644
index 0000000..0660fb9
Binary files /dev/null and b/2024-01-ctpsr-ai/figures/DALL-E 2024-01-11 10.54.12 - An educational illustration showcasing different types of artificial intelligence. The image is divided into several sections, each representing a dif.png differ