{
 "cells": [
  {
   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# High-Resolution Emission Retrievals\n",
    "\n",
    "This tutorial covers how to run a retrieval with high-resolution ground-based emission data using POSEIDON. \n",
    "\n",
    "Before you run this notebook, you should run the [\\\"Ground-Based High-Resolution Emission Spectroscopy (Cross Correlation)\\\"](emission_high_res_cross_correlate.html) tutorial first to preprocess the data. If you have data_processed.hdf5 saved in your planet directory, you are all set!\n",
    "\n",
    "We will reproduce the result from [Brogi and Line 2019](https://ui.adsabs.harvard.edu/abs/2021Natur.598..580L/abstract), validating our framework on WASP-77Ab."
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Loading WASP-77Ab Emission Data\n",
    "\n",
    "First, we will load the processed data for WASP-77 Ab. For more information about this dataset and to learn the basics of high-resolution cross correlation spectroscopy, see the [\\\"Ground-Based High-Resolution Emission Spectroscopy (Cross Correlation)\\\"](emission_high_res_cross_correlate.html) tutorial."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "from POSEIDON.high_res import read_high_res_data\n",
    "\n",
    "planet_name = 'WASP-77Ab'\n",
    "\n",
    "data_dir = '../../../POSEIDON/reference_data/observations/' + planet_name # The directory where you've put the data\n",
    "\n",
    "data = read_high_res_data(data_dir, names=[\"IGRINS\"])  # We named the dataset IGRINS in the previous notebook"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Creating a Retrieval Model\n",
    "\n",
    "Now, let's provide the wavelength grid and properties of the host star and your planet. The wavelength range should match the range of your data, which spans 1.3 microns to 2.6 microns in this case.\n",
    "\n",
    "We use R=250,000 as a tradeoff between computational speed and accuracy. For more discussion, see the previous tutorial."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 1,
   "metadata": {},
   "outputs": [],
   "source": [
    "from POSEIDON.core import define_model, wl_grid_constant_R\n",
    "from POSEIDON.core import create_star, create_planet\n",
    "from POSEIDON.constants import R_Sun, R_J, M_J\n",
    "\n",
    "# ***** Wavelength grid *****#\n",
    "\n",
    "wl_min = 1.3  # Minimum wavelength (um)\n",
    "wl_max = 2.6  # Maximum wavelength (um)\n",
    "R = 250000    # Change the spectral resolution of grid here.\n",
    "\n",
    "# Create a wavelength grid with constant R\n",
    "wl = wl_grid_constant_R(wl_min, wl_max, R)\n",
    "\n",
    "# ***** Define stellar properties *****#\n",
    "\n",
    "R_s = 0.91 * R_Sun  # Stellar radius (m)\n",
    "T_s = 5605.0        # Stellar effective temperature (K)\n",
    "Met_s = -0.04       # Stellar metallicity [log10(Fe/H_star / Fe/H_solar)]\n",
    "log_g_s = 4.48      # Stellar log surface gravity (log10(cm/s^2) by convention)\n",
    "\n",
    "star = create_star(R_s, T_s, log_g_s, Met_s, wl=wl, stellar_grid=\"phoenix\")\n",
    "\n",
    "# ***** Define planet properties *****#\n",
    "\n",
    "planet_name = \"WASP-77Ab\"  # Planet name used for plots, output files etc.\n",
    "\n",
    "R_p = 1.21 * R_J  # Planetary radius (m)\n",
    "M_p = 1.76 * M_J  # Mass of planet (kg)\n",
    "\n",
    "# Create the planet object\n",
    "planet = create_planet(planet_name, R_p, mass=M_p)\n",
    "\n",
    "# If distance not specified, use fiducial value\n",
    "if planet[\"system_distance\"] is None:\n",
    "    planet[\"system_distance\"] = 1  # This value only used for flux ratios, so it cancels\n",
    "d = planet[\"system_distance\"]"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Existing literature have shown detection of $\\rm{H}_2\\rm{O}$, $\\rm{C}\\rm{O}$, $\\rm{N}\\rm{H}_3$, and $\\rm{C}\\rm{H}_4$ in the atmosphere of WASP-77Ab.\n",
    "\n",
    "So for a first attempt, we consider a model with $\\rm{H}_2\\rm{O}$, $\\rm{C}\\rm{O}$, $\\rm{N}\\rm{H}_3$, and $\\rm{C}\\rm{H}_4$, a 5-parameter temperature profile (Madhusudan & Seager 2009), and no clouds.\n",
    "\n",
    "For additional parameters used in high resolution retrieval, we include: $log_\\alpha$ (the scaling parameter), $K_p$ (the Keplerian orbital velocity), $V_{sys}$ (the systematic velocity), and $W_{conv}$ (width of the gaussian convolution kernel used for line broadening). An additional parameter available is $\\Delta \\phi$ (offseting the ephemeris)."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 2,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Free parameters: ['R_p_ref' 'a1' 'a2' 'log_P1' 'log_P2' 'log_P3' 'T_ref' 'log_H2O' 'log_CO'\n",
      " 'log_NH3' 'log_CH4' 'K_p' 'V_sys' 'W_conv' 'log_alpha_HR']\n"
     ]
    }
   ],
   "source": [
    "# ***** Define model *****#\n",
    "\n",
    "model_name = \"Retrieval\"  # Model name used for plots, output files etc.\n",
    "bulk_species = [\"H2\", \"He\"]  # H2 + He comprises the bulk atmosphere\n",
    "\n",
    "param_species = [\"H2O\", \"CO\", \"NH3\", \"CH4\"]\n",
    "\n",
    "model = define_model(model_name, bulk_species, param_species, \n",
    "                     PT_profile = \"Madhu\", reference_parameter = \"R_p_ref\",\n",
    "                     high_res_method = \"sysrem\", # Important! Should be the same as the method used to preprocess the data\n",
    "                     alpha_high_res_option = 'log', \n",
    "                     fix_alpha_high_res = False, fix_W_conv_high_res = False,\n",
    "                     fix_beta_high_res = True, fix_Delta_phi_high_res = True,\n",
    "                     )\n",
    "\n",
    "# Check the free parameters defining this model\n",
    "print(\"Free parameters: \" + str(model[\"param_names\"]))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Setting Retrieval Priors\n",
    "\n",
    "One of the most important aspects in any Bayesian analysis is deciding what priors to use for the free parameters. Specifying a prior has two steps: (i) choosing the type of probability distribution; and (ii) choosing the allowable range.\n",
    "\n",
    "Most free parameters in atmospheric retrievals with POSEIDON use the following prior types:\n",
    "\n",
    "- Uniform: you provide the minimum and maximum values for the parameter.\n",
    "- Gaussian: you provide the mean and standard deviation for the parameter.\n",
    "\n",
    "<div class=\"alert alert-info\">\n",
    "\n",
    "  **Note:**\n",
    "\n",
    "  If you do not specify a prior type or range for a given parameter, POSEIDON will ascribe a default prior type (generally uniform) and a 'generous' range.\n",
    "\n",
    "</div>\n",
    "\n",
    "\n",
    "This retrieval is defined by 15 free parameters printed above: (1) the radius at the (fixed) reference pressure; (2) the P-T profile parameters; (3) the log-mixing ratios; and (4) the four high resolution parameters.\n",
    "\n",
    "Since we are assuming no *a priori* information on WASP-77Ab's atmosphere, we will use uniform priors for all the parameters."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": 4,
   "metadata": {},
   "outputs": [],
   "source": [
    "from POSEIDON.core import set_priors\n",
    "\n",
    "# ***** Set priors for retrieval *****#\n",
    "\n",
    "# Initialise prior type dictionary\n",
    "prior_types = {}\n",
    "\n",
    "# Specify whether priors are linear, Gaussian, etc.\n",
    "prior_types[\"T_ref\"] = \"uniform\"\n",
    "prior_types[\"R_p_ref\"] = \"uniform\"\n",
    "prior_types[\"log_X\"] = \"uniform\"\n",
    "prior_types[\"a1\"] = \"uniform\"\n",
    "prior_types[\"a2\"] = \"uniform\"\n",
    "prior_types[\"log_P1\"] = \"uniform\"\n",
    "prior_types[\"log_P2\"] = \"uniform\"\n",
    "prior_types[\"log_P3\"] = \"uniform\"\n",
    "prior_types[\"K_p\"] = \"uniform\"\n",
    "prior_types[\"V_sys\"] = \"uniform\"\n",
    "prior_types[\"log_alpha_HR\"] = \"uniform\"\n",
    "prior_types[\"W_conv\"] = \"uniform\"\n",
    "\n",
    "# Initialise prior range dictionary\n",
    "prior_ranges = {}\n",
    "\n",
    "# Specify prior ranges for each free parameter\n",
    "prior_ranges[\"T_ref\"] = [500, 2000]\n",
    "prior_ranges[\"R_p_ref\"] = [0.5 * R_p, 1.5 * R_p]\n",
    "prior_ranges[\"log_X\"] = [-15, 0]\n",
    "prior_ranges[\"a1\"] = [0, 1]\n",
    "prior_ranges[\"a2\"] = [0, 1]\n",
    "prior_ranges[\"log_P1\"] = [-5, 2]\n",
    "prior_ranges[\"log_P2\"] = [-5, 2]\n",
    "prior_ranges[\"log_P3\"] = [-2, 2]\n",
    "prior_ranges[\"K_p\"] = [150, 250]\n",
    "prior_ranges[\"V_sys\"] = [-50, 50]\n",
    "prior_ranges[\"log_alpha_HR\"] = [-2, 2]\n",
    "prior_ranges[\"W_conv\"] = [0, 50]\n",
    "\n",
    "# Create prior object for retrieval\n",
    "priors = set_priors(planet, star, model, data, prior_types, prior_ranges)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Pre-load Opacities\n",
    "\n",
    "The last step before running a retrieval is to pre-interpolate the cross sections for our model and store them in memory. For more details on this process, see the forward model tutorial.\n",
    "\n",
    "<div class=\"alert alert-warning\">\n",
    "\n",
    "  **Warning:**\n",
    "\n",
    "  Ensure the range of $T_{\\rm{fine}}$ used for opacity pre-interpolation is at least as large as the desired prior range for temperatures to be explored in the retrieval. Any models with layer temperatures falling outside the range of $T_{\\rm{fine}}$ will be automatically rejected (for retrievals with non-isothermal P-T profiles, this prevents unphysical profiles with negative temperatures etc.)\n",
    "\n",
    "</div>"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "Reading in cross sections in opacity sampling mode...\n",
      "H2-H2 done\n",
      "H2-He done\n",
      "H2O done\n",
      "CO done\n",
      "Opacity pre-interpolation complete.\n"
     ]
    }
   ],
   "source": [
    "from POSEIDON.core import read_opacities\n",
    "import numpy as np\n",
    "\n",
    "# ***** Read opacity data *****#\n",
    "opacity_treatment = \"opacity_sampling\"\n",
    "\n",
    "# Define fine temperature grid (K)\n",
    "T_fine_min = 400  # 400 K lower limit suffices for a typical hot Jupiter\n",
    "T_fine_max = 4000  # 2000 K upper limit suffices for a typical hot Jupiter\n",
    "T_fine_step = 50  # 20 K steps are a good tradeoff between accuracy and RAM\n",
    "\n",
    "T_fine = np.arange(T_fine_min, (T_fine_max + T_fine_step), T_fine_step)\n",
    "\n",
    "# Define fine pressure grid (log10(P/bar))\n",
    "log_P_fine_min = -5.0  # 1 ubar is the lowest pressure in the opacity database\n",
    "log_P_fine_max = 2  # 100 bar is the highest pressure in the opacity database\n",
    "log_P_fine_step = 0.2  # 0.2 dex steps are a good tradeoff between accuracy and RAM\n",
    "\n",
    "log_P_fine = np.arange(log_P_fine_min, (log_P_fine_max + log_P_fine_step), log_P_fine_step)\n",
    "\n",
    "opac = read_opacities(model, wl, opacity_treatment, T_fine, log_P_fine)"
   ]
  },
  {
   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Run Retrieval\n",
    "\n",
    "You are now ready to run your high resolution atmospheric retrieval on this dataset!\n",
    "\n",
    "<div class=\"alert alert-info\">\n",
    "\n",
    "  **Tip:**\n",
    "\n",
    "  Retrievals run faster on multiple cores. When running the cells in this Jupyter notebook, only a single core will be used. You can run a multi-core retrieval on 24 cores by converting this Jupyter notebook into a python script, then calling mpirun on the .py file:\n",
    "\n",
    "  ```\n",
    "  mpirun -n 24 python -u YOUR_RETRIEVAL_SCRIPT.py\n",
    "  ```\n",
    "  \n",
    "</div>\n",
    "\n",
    "\n",
    "<div class=\"alert alert-info\">\n",
    "\n",
    "  **Important Note:**\n",
    "  A high resolution forward model is computationally expensive (~1 second per model). With 400 live points, it took > $10^6$ evaluations for the model to converge. This retrieval could be finished with ~8 hours on 24 cores.\n",
    "\n",
    "  Instead of waiting until the end of time for the next cell to finish, you could run the 'emission_high_res_retrieval.py' file in this folder, which is the same code converted from this notebook, and parallelise with multiple cores in command line. \n",
    "  \n",
    "  To check the code is working before launching a high-res retrieval, you can run the cell below and wait for a couple of minutes. Once it says \"live points generated\" and still no error, you are good to run it on multiple cores!\n",
    "  \n",
    "</div>"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [
    {
     "name": "stdout",
     "output_type": "stream",
     "text": [
      "POSEIDON now running 'Retrieval'\n",
      " *****************************************************\n",
      " MultiNest v3.10\n",
      " Copyright Farhan Feroz & Mike Hobson\n",
      " Release Jul 2015\n",
      "\n",
      " no. of live points =  400\n",
      " dimensionality =   13\n",
      " *****************************************************\n",
      " Starting MultiNest\n",
      " generating live points\n",
      " live points generated, starting sampling\n",
      "Acceptance Rate:                        0.993377\n",
      "Replacements:                                450\n",
      "Total Samples:                               453\n",
      "Nested Sampling ln(Z):            8309213.271610\n",
      "Acceptance Rate:                        0.976562\n",
      "Replacements:                                500\n",
      "Total Samples:                               512\n",
      "Nested Sampling ln(Z):            8309219.101900\n",
      "Acceptance Rate:                        0.929054\n",
      "Replacements:                                550\n",
      "Total Samples:                               592\n",
      "Nested Sampling ln(Z):            8309220.009087\n"
     ]
    }
   ],
   "source": [
    "from POSEIDON.retrieval import run_retrieval\n",
    "\n",
    "# ***** Specify fixed atmospheric settings for retrieval *****#\n",
    "\n",
    "# Atmospheric pressure grid\n",
    "P_min = 1e-5     # 10 ubar\n",
    "P_max = 100     # 100 bar\n",
    "N_layers = 100  # 100 layers\n",
    "\n",
    "# Let's space the layers uniformly in log-pressure\n",
    "P = np.logspace(np.log10(P_max), np.log10(P_min), N_layers)\n",
    "\n",
    "# Specify the reference pressure and radius\n",
    "P_ref = 1e-2  # Reference pressure (bar)\n",
    "\n",
    "# ***** Run atmospheric retrieval *****#\n",
    "\n",
    "run_retrieval(planet, star, model, opac, data, priors, wl, P, P_ref, R_p_ref = R_p,\n",
    "              R = R, spectrum_type = \"emission\", sampling_algorithm = \"MultiNest\",\n",
    "              N_live = 400, verbose = True, N_output_samples = 1000, \n",
    "              resume = False)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "## Plot Retrieval Results"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Generate a corner plot after the retrieval is finished\n",
    "from POSEIDON.corner import generate_cornerplot\n",
    "\n",
    "fig_corner = generate_cornerplot(planet, model)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "metadata": {},
   "outputs": [],
   "source": [
    "# Read retrieved PT profile and plot it\n",
    "from POSEIDON.utility import read_retrieved_PT\n",
    "from POSEIDON.visuals import plot_PT_retrieved\n",
    "\n",
    "# Read the retrieved PT profile\n",
    "P, T_low2, T_low1, T_median, \\\n",
    "T_high1, T_high2 = read_retrieved_PT(planet_name, model_name)\n",
    "\n",
    "\n",
    "PT_median = [(T_median, P)]\n",
    "PT_low2 = [(T_low2, P)]\n",
    "PT_low1 = [(T_low1, P)]\n",
    "PT_high1 = [(T_high1, P)]\n",
    "PT_high2 = [(T_high2, P)]\n",
    "\n",
    "# Plot the retrieved PT profile\n",
    "plot_PT_retrieved(planet_name, PT_median, PT_low2, PT_low1, PT_high1, PT_high2,\n",
    "                  # T_true=None, # Uncomment this line if you have a PT profile to compare to\n",
    "                  # # colour_list=[], # Uncomment this line if you want to specify colors\n",
    "                  T_min=2000, T_max=4000,\n",
    "                  legend_location=\"lower left\"\n",
    "                  )"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "Below is the corner plot and retrieved PT profile from a retrieval on this dataset.\n",
    "\n",
    "![title](../../_static/notebook_images/high_res_emis_corner.png)"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "POSEIDON_python_3.11",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.11.9"
  },
  "orig_nbformat": 4
 },
 "nbformat": 4,
 "nbformat_minor": 2
}