c775803ab4f2e94e.tex
1: \begin{abstract}
2: Motivated by the empirical power law of the distributions of credits (e.g., the number of ``likes'') of viral posts in social media, we introduce the high-dimensional tail index regression and methods of estimation and inference for its parameters.
3: We propose a regularized estimator, establish its consistency, and derive its convergence rate.
4: To conduct inference, we propose to debias the regularized estimate, and establish the asymptotic normality of the debiased estimator.
5: Simulation studies support our theory.
6: These methods are applied to text analyses of viral posts in X (formerly Twitter) concerning LGBTQ$+$.
7: 
8: {\small {\ \ \ \newline
9: \textbf{Keywords: } high-dimensional data, social media, tail index, text analysis.} \newline }
10: 
11: \end{abstract}
12: