@@ -5,4 +5,13 @@ We first evaluated the effect of dimensions (i.e., number of bases) on model acc

*[Figure 1](figs/hcp_bases_comparison.png). shows the boxplots of model accuracy (correlation) across 100 HCP subjects at four different dimensions (from left to right panel, 15, 25, 50, 100) for each set of bases respectively (ICA: green; PCA: orange; Laplacian Eigenmap: blue). Overall the choice of bases and dimensions has minor effects on model accuracy in predicting task activation maps.

*[Figure 2](figs/ukb_bases_comparison.png). shows the model accuracy in predicting task activation on 100 UKB subjects at dimension 25 (left panel), 100 (middle) and 200 (right panel) for ICA (green), PCA (orange) and Laplacian Eigenmaps (blue). Increases in bases dimension tend to decrease model accuracy possibly due to overfitting. Meanwhile, ICA dual regression maps outperforms the other two approaches in reconstructing the three contrast maps. It is also interesting to note that the model accuracy boxplots across subjects has smaller quartile range at higher dimensions (which is also true on HCP data, although not as obvious).

*[Figure 2](figs/ukb_bases_comparison.png). shows the model accuracy in predicting task activation on 100 UKB subjects at dimension 25 (left panel), 100 (middle) and 200 (right panel) for ICA (green), PCA (orange) and Laplacian Eigenmaps (blue). Increases in bases dimension tend to decrease model accuracy possibly due to overfitting. Meanwhile, ICA dual regression maps outperforms the other two approaches in reconstructing the three contrast maps. It is also interesting to note that the model accuracy boxplots across subjects has smaller quartile range at higher dimensions (which is also true on HCP data, although not as obvious)

### 2. Effects of averaging coefficients of best matched subjects.

We next matched the bases of each subject to the rest in the pool (HCP: 967 subjects; UKB: 1529 subjects) to find a subset of 100 ``other'' subjects that best resemble the chosen one. Each subject pair has a measure of ``matchness'', which is the average of spatial correlations of the matched bases. To reconstruct task activation maps for a new subject, we took weighted average of the coefficients of the 100 best-matched subjects, benchmarked against 100 randomly chosen subjects in the pool and another 100 subjects that provide the worst match of the bases.

* Results are shown in [Figure 3](figs/ukb_match_subjects.png)(UKB subjects) and [Figure 4](figs/hcp_match_subjects.png)(HCP subjects). As expected, reconstruction based on 100 best-matched subjects has highest model accuracy than using 100 unmatched subjects, suggesting a careful selection of subjects can further boost model accuracy in predicting how a new subject respond to tasks (green: 100 best-matched subjects; orange: 100 randomly-chosen subjects; blue: 100 least-matched subjects).