Cathodoluminescence (CL) data: ------------------------------ Each dataset is saved in a proprietary file format, consisting of 4 separate files: .clm Text header file with pixel dimensions, step size etc .clw List of wavelengths corresponding to the spectral channels. Given in nm, in binary form using single-precision (4-byte) floating point values. .cld The bulk of the data, given a spectrum at a time, starting at the top left pixel and moving right first. In other words, to access the data in order using nested loops, the inner loop would loop over wavelength, the middle one over columns and the outer one over rows. Stored in binary form as 16-bit unsigned integers. .clb This is the CCD background spectrum, which needs to be subtracted from each of the spectra in the .cld file. Stored as single-precision (4-byte) floating point values. Electron backscatter diffraction (EBSD) data: --------------------------------------------- The EBSD data was aquired with a commercial EBSD system (Oxford Instruments Nordlys) and software (Oxford Instruments AZtec). Unprocessed electron backscatter patterns are saved as individuals .tiff files together a pattern of the static background. High resolution EBSD (HR-EBSD) analysis was carried out using the commercial CrossCourt 4 software. Raman data: ----------- Raman spectroscopy measurements were performed using a Renishaw Invia spectrometer and a 488~nm solid-state laser and Renishaw Wire software (.wdf file).