Define subset name to either 'CGNER' for course-granular dataset or 'FGNER' for fine-granular dataset. In both cases, the NER datasets from the four CONLL files are combined and randomly sampled into train, val and test splits. Moreover, in the case of FG-NER, only the most frequent 27 classes were included, all other classes were annotated as O tags. These training data is located inside `examples/data` folder.