embodied-computation-group
diff --git a/‎docs/source/examples/R/Example scripts/Example_analysis_bayesian.Rmd
Lines changed: 69 additions & 11 deletions b/‎docs/source/examples/R/Example scripts/Example_analysis_bayesian.Rmd
Lines changed: 69 additions & 11 deletions
diff --git a/‎docs/source/examples/R/Example scripts/Example_analysis_simple.Rmd
Lines changed: 6 additions & 6 deletions b/‎docs/source/examples/R/Example scripts/Example_analysis_simple.Rmd
Lines changed: 6 additions & 6 deletions
diff --git a/‎docs/source/examples/R/src/Lapse Cummulative normal
2.57 MB b/‎docs/source/examples/R/src/Lapse Cummulative normal
2.57 MB
diff --git a/‎docs/source/examples/R/src/Lapse Cummulative normal.stan
Lines changed: 34 additions & 0 deletions b/‎docs/source/examples/R/src/Lapse Cummulative normal.stan
Lines changed: 34 additions & 0 deletions
diff --git a/‎docs/source/examples/R/src/Standard Cummulative normal
2.55 MB b/‎docs/source/examples/R/src/Standard Cummulative normal
2.55 MB
diff --git a/‎docs/source/examples/R/src/Standard Cummulative normal.stan
Lines changed: 24 additions & 0 deletions b/‎docs/source/examples/R/src/Standard Cummulative normal.stan
Lines changed: 24 additions & 0 deletions
diff --git a/‎docs/source/examples/R/src/first_model
-2.05 MB b/‎docs/source/examples/R/src/first_model
-2.05 MB
diff --git a/‎docs/source/examples/R/src/first_model.stan
Lines changed: 0 additions & 19 deletions b/‎docs/source/examples/R/src/first_model.stan
Lines changed: 0 additions & 19 deletions
diff --git a/‎docs/source/examples/R/src/firstlevelanalysis.R
Lines changed: 55 additions & 48 deletions b/‎docs/source/examples/R/src/firstlevelanalysis.R
Lines changed: 55 additions & 48 deletions
@@ -9,8 +9,12 @@ output: html_document
 # **Here we show how to perform a bayesian analysis on the data after it has been collected which is very similar to the "simple" analysis. Please see the "simple analysis before this!**
 
 ```{r message=FALSE}
-pacman::p_load(tidyverse,ggdist,psycho,caret,patchwork, gt, cowplot, grid,reticulate,cmdstanr,posterior,rstan,bayesplot,here,rmarkdown)
+pacman::p_load(tidyverse,ggdist,psycho,caret,patchwork, gt, cowplot,
+               grid,reticulate,cmdstanr,posterior,rstan,bayesplot,here,rmarkdown,pracma,
+               brms)
 np <- import("numpy")
+
+set.seed(111)
 ```
 
 
@@ -28,12 +32,67 @@ df = psychophysics_df %>% filter(Subject == "sub_0042")
 source(here("docs","source","examples","R","src","firstlevelanalysis.R"))
 ```
 
-The only difference here is that we set bayesian equal to T (TRUE) and specify the model. Here the model is a predefined Stan model that is inside the src folder called first_model.stan
+The only difference here is that we set bayesian equal to T (TRUE) and specify the model. The models can be found in the src folder inside the .stan files. These are probabilitic models written in stan that are compiled and does the sampling. There are two options at the moment for re-fitting there is the standard cummulative normal aswell as a cummulative normal that incorporates a lapse rate, that sepcifies the minimum and maximum of the tails of the psychometric. This means that a lapse rate of 5% (0.05) means that the psychometric on the lower end is 5% and on the upper end is 95%. The reason to include a lapse rate is that if responses are made that are attentional slips or misclicks in the far end of the stimulus spectrum (high or low) then this is greatly influence the slope of the psychometric if modelled without the lapse rate. 
+
+The priors of the bayesian model is as follows in the unconstrained space (beta is constained to be positive so is exponentially transformed, and the lapse is constrained between 0 and 0.5 meaning its inv_logit transformed and then devided by 2:
+
+alpha ~ normal(0,20);
+beta ~ normal(0,3);
+lambda ~ normal(-4,2);
+
+This means that the parameters in the constrained space look like this:
+```{r}
+data.frame(alpha = rnorm(1000,0,20), beta = exp(rnorm(10000,0,3)), lambda = brms::inv_logit_scaled(rnorm(1000,-4,2)) / 2) %>% 
+  pivot_longer(everything(), values_to = "value",names_to = "parameter") %>% 
+  ggplot(aes(x = value, fill = parameter))+geom_histogram(col = "black")+facet_wrap(~parameter, scales = "free")+theme_classic()
+```
+
+Below there is a visualization of what this extra lapse rate does as well as what the priors of the model means when looking at the psychometric function itself:
+
+```{r}
+n_sim = 25
+
+alpha = rnorm(n_sim,0,20)
+beta = rnorm(n_sim,0,3)
+lambda = rnorm(n_sim,-4,2)
+
+data.frame(alpha = alpha, beta = exp(beta), lambda = brms::inv_logit_scaled(lambda) / 2) %>% 
+  rowwise() %>% 
+  mutate(x = list(seq(-80,80,0.1)),
+         y = list(psychometric(seq(-80,80,0.1), alpha, beta, lambda))
+         ) %>% 
+  ungroup %>% 
+  mutate(id = 1:n()) %>% 
+  unnest(cols = c(x, y)) %>% mutate(lapse = T) %>% 
+  ggplot(aes(x = x, y = y, group = id))+
+  geom_line(alpha = 0.5)+theme_classic()+ggtitle("With Lapse rate")
+
+
+
+data.frame(alpha = alpha, beta = exp(beta), lambda = NA) %>% 
+  rowwise() %>% 
+  mutate(x = list(seq(-80,80,0.1)),
+         y = list(psychometric_nolapse(seq(-80,80,0.1), alpha, beta))
+         ) %>% 
+  ungroup %>% 
+  mutate(id = 1:n()) %>% 
+  unnest(cols = c(x, y)) %>% mutate(lapse = F) %>% 
+  ggplot(aes(x = x, y = y, group = id))+
+  geom_line(alpha = 0.5)+theme_classic()+ggtitle("Without Lapse rate")
+
+
+```
+
+If you want to change the priors of the Bayesian model, this has to be done inside the Stan scripts. By opening the .stan File and then changing the last couple of lines of code where the syntax is the same as above, it is therefore possible to visualize what the prior distributions for the parameters and also see what they entail (prior predictive checks) for the shape of the psychometric here in the markdown script and then changing them inside the Stan scripts themselves. 
+
 
-**Doing the same as for the simple analysis with bayesian = T**
+**Running the analysis using this bayesian fit invovles the same as for the simple analysis with two addition arguments firstly the flage bayesian needs to be set to T (TRUE), and a model has to be specified. There are at the moment two different models to choose from, one with the lapse rate and one without**
 
 ```{r message=FALSE, results='hide',warning=FALSE}
-model = cmdstan_model(here("docs","source","examples","R","src","first_model.stan"))
+# No lapse rate model:
+model = cmdstan_model(here("docs","source","examples","R","src","Standard Cummulative normal.stan"))
+# Lapse rate model:
+model = cmdstan_model(here("docs","source","examples","R","src","Lapse Cummulative normal.stan"))
 
 results = single_sub_analysis(df, 
                               interoPost = NA, 
@@ -43,25 +102,24 @@ results = single_sub_analysis(df,
                               out = here::here("docs","source","examples","R"))
 ```
 
-**The results list now also contains a new index called bayesian_plot. This is a list of either 1 or 3 plots. There'll be 1 if you only have one Morality and 3 if you have two (Extero and Intero). Here there is 3 plots**
+**The results list now also contains a new index called bayesian_plot. This is a list of either 1 or 3 plots. There will be 1 if you only have one Morality and 2 if you have two (Extero and Intero). Here there is 3 plots**
 
 Lets look at them individually:
 
 ```{r}
 results$bayesian_plot[[1]]
 ```
 
-**NOTE: The Import thing to look at for good model convergence is the upper plots: Here we see that all the 4 chains (to the left) seem to capture the same posterior distribution. It is also clear from the trace-plots to the upper right that the chains mix well (hairy catterpillars), meaning good convergence**
+**NOTE: The Import thing to look at for good model convergence is the upper plots: Here we see that all the 4 chains (to the left) seem to capture the same posterior distribution. It is also clear from the trace-plots to the upper right that the chains mix well (hairy catterpillars), meaning good convergence. Lastly looking into whether there are divergences in the sampling process is pivotal, these are stored in the stats file under divergences, if this column is not 0, then trusting the estimates even with good looking chains is not advised. Dealing with divergences for single subjects fits like here involves changing priors and or the model itself (i.e. leaving out or including the lapse rate)**
 
 ```{r}
 results$bayesian_plot[[2]]
 ```
-
-
-And the combined plot can be found in the last index
-```{r, fig.height=8,fig.width=14}
-results$bayesian_plot[[3]]
+##**Here is the number of mean in both conditions divergences:**
+```{r}
+results$stats$divergences
 ```
+Indicating that there are divergences here so perhaps runnning without the Lapse rate would be preferable, or changing the priors.
 
 
 **Of cause this can be run through several subjects like the "simple" analysis**
 
@@ -59,12 +59,12 @@ out is the output directory for the results of the analysis
 
 
 ```{r message=FALSE, results='hide',warning=FALSE}
-results = single_sub_analysis(df,                                                                  #The raw dataframe
-                              interoPost = interoPost,                                             #numpy array for the intero (NA if not avaliable)
-                              exteroPost = exteroPost,                                             #numpy array for the extero (NA if not avaliable)
-                              bayesian = F,                                                        #Bayesian Analysis (TRUE/FALSE)
-                              model = NA,                                                          #Bayesian model here a stan script (NA if Bayesian is FALSE)
-                              out = here::here("docs","source","examples","R"))                    #Output directory for results      
+results = single_sub_analysis(df,                                                         #The raw dataframe
+                              interoPost = interoPost,                                    #numpy array for the intero (NA if not avaliable)
+                              exteroPost = exteroPost,                                    #numpy array for the extero (NA if not avaliable)
+                              bayesian = F,                                               #Bayesian Analysis (TRUE/FALSE)
+                              model = NA,                                                 #Bayesian model here a stan script (NA if Bayesian is FALSE)
+                              out = here::here("docs","source","examples","R"))           #Output directory for results      
 ```
 
 **Note that these analyses can also be run with only one "Modality", important is that either the interopost or exteropost then gets set to NA i.e. the modality you do not have access to!**
 
@@ -0,0 +1,34 @@
+data {
+  int<lower=0> N;
+  array[N] int<lower=0> n;
+  array[N] int <lower=0> y;
+  vector[N] x;
+}
+parameters {
+  real alpha;
+  real beta_unconstrained;
+  real lambda_unconstrained;
+}
+
+
+transformed parameters{
+  
+  real<lower=0> beta = exp(beta_unconstrained);
+  real<lower=0,upper=0.5> lambda = inv_logit(lambda_unconstrained) / 2;
+  
+  
+}
+
+
+model {
+  
+  for (i in 1:N){
+        y[i] ~ binomial(n[i], lambda  + (1 - 2 * lambda) * (0.5+0.5*erf((x[i]-alpha)/(beta*sqrt(2)))));
+  }
+  
+  alpha ~ normal(0,20);
+  beta_unconstrained ~ normal(0,3);
+  lambda_unconstrained ~ normal(-4,2);
+  
+    
+}
@@ -0,0 +1,24 @@
+data {
+  int<lower=0> N;
+  array[N] int<lower=0> n;
+  array[N] int <lower=0> y;
+  vector[N] x;
+}
+parameters {
+  real alpha;
+  real beta_unconstrained;
+}
+
+transformed parameters{
+  
+  real <lower=0> beta = exp(beta_unconstrained);
+}
+
+model {
+  for (i in 1:N){
+    y[i] ~ binomial(n[i], 0.5+0.5*erf((x[i]-alpha)/(beta*sqrt(2))));
+  }
+  alpha ~ normal(0, 20);
+  beta_unconstrained ~ normal(0,3);
+    
+}
@@ -153,67 +153,74 @@ single_sub_analysis <- function(df, interoPost = NA, exteroPost = NA, bayesian =
   # if the bayesian analysis is selected:
   if (bayesian == TRUE) {
     if (n_mod == 2) {
+      
+        # run bayesian analysis on Extero and Intero and append the statistics to the dataframe (resultsdata)
+        results = run_bayes_analysis(df1, model)
+        
+        # Combine stuff for stats and plots
+        stats <- rbind(results[["Intero"]][["stats"]],results[["Extero"]][["stats"]])
+        
+        resultsdata <- cbind(resultsdata, stats)
+        
+        baysplot_ex <- results[["Extero"]][["chainplot"]] + results[["Extero"]][["traceplot"]] + results[["Extero"]][["bayseplot"]] +
+          plot_layout(design = c(
+          patchwork::area(1, 1, 1, 1),
+          patchwork::area(1, 2, 1, 2),
+          patchwork::area(2, 1, 3, 2)
+        ))
+        
+        baysplot_in <- results[["Intero"]][["chainplot"]] + results[["Intero"]][["traceplot"]] + results[["Intero"]][["bayseplot"]] +
+          plot_layout(design = c(
+          patchwork::area(1, 1, 1, 1),
+          patchwork::area(1, 2, 1, 2),
+          patchwork::area(2, 1, 3, 2)
+        ))
+        
+        # save the figures
+        ggsave(paste0(output_dir,"/resultplot_bayse_intero",idx,".png"), baysplot_in, width = 4000, height = 2200, units = "px")
+        ggsave(paste0(output_dir,"/resultplot_bayse_extero",idx,".png"), baysplot_ex, width = 4000, height = 2200, units = "px")
+        
+        bayesplot <- list(baysplot_ex, baysplot_in)
+    }
+    
+    
+    if (n_mod == 1) {
+      
+      modality = unique(df$Modality)
       # run bayesian analysis on Extero and Intero and append the statistics to the dataframe (resultsdata)
-      baysextero <- baysiananalysis(df, "Extero", model)
-      stats <- baysextero[[4]]
+      results = run_bayes_analysis(df1, model)
 
-      baysintero <- baysiananalysis(df, "Intero", model)
-      stats <- rbind(stats, baysintero[[4]])
+      stats <- rbind(results[[modality]][["stats"]])
 
       resultsdata <- cbind(resultsdata, stats)
 
-      baysplot_ex <- baysextero[[1]] + baysextero[[2]] + baysextero[[3]] + plot_layout(design = c(
-        patchwork::area(1, 1, 1, 1),
-        patchwork::area(1, 2, 1, 2),
-        patchwork::area(2, 1, 3, 2)
-      ))
-      
-      baysplot_in <- baysintero[[1]] + baysintero[[2]] + baysintero[[3]] + plot_layout(design = c(
-        patchwork::area(1, 1, 1, 1),
-        patchwork::area(1, 2, 1, 2),
-        patchwork::area(2, 1, 3, 2)
-      ))
-      
-      baysplot <- baysextero[[1]] + baysextero[[2]] + baysextero[[3]] + baysintero[[1]] + baysintero[[2]] + baysintero[[3]] + plot_layout(design = c(
-        patchwork::area(1, 1, 1, 1),
-        patchwork::area(1, 2, 1, 2),
-        patchwork::area(2, 1, 3, 2),
-        patchwork::area(1, 3, 1, 3),
-        patchwork::area(1, 4, 1, 4),
-        patchwork::area(2, 3, 3, 4)
-      ))
+      bayesplot <- results[[modality]][["chainplot"]] + results[[modality]][["traceplot"]] + results[[modality]][["bayseplot"]] +
+        plot_layout(design = c(
+          patchwork::area(1, 1, 1, 1),
+          patchwork::area(1, 2, 1, 2),
+          patchwork::area(2, 1, 3, 2)
+        ))
 
       # save the figures
-      ggsave(paste0(output_dir,"/resultplot_bayse_intero",idx,".png"), baysplot_in, width = 4000, height = 2200, units = "px")
-      ggsave(paste0(output_dir,"/resultplot_bayse_extero",idx,".png"), baysplot_ex, width = 4000, height = 2200, units = "px")
-      ggsave(paste0(output_dir,"/resultplot_bayse",idx,".png"), baysplot, width = 4000, height = 2200, units = "px")
+      ggsave(paste0(output_dir,paste0("/resultplot_bayse_",modality),idx,".png"), bayesplot, width = 4000, height = 2200, units = "px")
 
-      baysplot <- list(baysplot_ex, baysplot_in, baysplot)
+      bayesplot <- list(bayesplot)
     }
 
-    if (n_mod == 1) {
-      bayse <- baysiananalysis(df, as.character(unique(df$Modality)), model)
-      stats <- bayse[[4]]
-      resultsdata <- cbind(resultsdata, stats)
+
+      # delete all duplicate columns in the resulting dataframe
+      
+      resultsdata <- resultsdata[, !duplicated(colnames(resultsdata))]
+      # give it sensisble rownames:
+      rownames(resultsdata) <- 1:nrow(resultsdata)
+      # save it
+      write.csv(resultsdata, paste0(output_dir,"/data",idx,".csv"))
+      
+      return(list(rt_plot = reactiontimeplot, summary_stat = stat, conf_plot = confidenceplot,AUC_plot = AUC_plot, staircase_plot = intervalplot, histogram_plot = intensityplot, analysis_plot = analysisplot, concatenated_plot = plot, stats = resultsdata, bayesian_plot = bayesplot))
 
-      baysplot <- bayse[[1]] + bayse[[2]] + bayse[[3]] + plot_layout(design = c(
-        patchwork::area(1, 1, 1, 1),
-        patchwork::area(1, 2, 1, 2),
-        patchwork::area(2, 1, 3, 2)
-      ))
-      ggsave(paste0(output_dir,"/resultplot_bayse",idx,".png"), baysplot, width = 4000, height = 2200, units = "px")
-    }
-    
-    # delete all duplicate columns in the resulting dataframe
-    
-    resultsdata <- resultsdata[, !duplicated(colnames(resultsdata))]
-    # give it sensisble rownames:
-    rownames(resultsdata) <- 1:nrow(resultsdata)
-    # save it
-    write.csv(resultsdata, paste0(output_dir,"/data",idx,".csv"))
 
-    return(list(rt_plot = reactiontimeplot, summary_stat = stat, conf_plot = confidenceplot,AUC_plot = AUC_plot, staircase_plot = intervalplot, histogram_plot = intensityplot, analysis_plot = analysisplot, concatenated_plot = plot, stats = resultsdata, bayesian_plot = baysplot))
   }
+  
   # delete all duplicate columns in the resulting dataframe
   resultsdata <- resultsdata[, !duplicated(colnames(resultsdata))]
   # give it sensisble rownames: