Scaling transformers to high-dimensional sparse data: a Reformer-BERT approach for large-scale classification. — SciRadar