2601.13558v1 Jan 20, 2026 cs.AI

ChatGPT 및 기타 자연어 처리 방법을 활용하여 남성 동성애자(MSM)의 위험 및 보호 요인 행동을 식별하는 연구: 소셜 미디어 및 데이팅 앱 텍스트 분석

Leveraging ChatGPT and Other NLP Methods for Identifying Risk and Protective Behaviors in MSM: Social Media and Dating apps Text Analysis

Mehrab Beikzadeh

Citations: 56

h-index: 4

Chenglin Hong

Citations: 509

h-index: 12

C. Cascalheira

Citations: 514

h-index: 13

Callisto Boka

Citations: 8

h-index: 2

Majid Sarrafzadeh

Citations: 8

h-index: 2

Ian W. Holloway

Citations: 74

h-index: 5

남성 동성애자(MSM)는 이성애 남성에 비해 성병 감염 및 과도한 음주 위험이 높습니다. 소셜 미디어 및 데이팅 애플리케이션에서 수집된 텍스트 데이터는 위험 및 보호 요인 행동을 자동으로 식별하여 맞춤형 공공 보건 개입을 위한 새로운 기회를 제공할 수 있습니다. 본 연구에서는 소셜 미디어 및 데이팅 앱의 텍스트가 MSM의 성적 위험 행동, 음주 및 사전 노출 예방 요법(PrEP) 사용 여부를 예측하는 데 사용될 수 있는지 평가했습니다. 참가자의 동의를 얻어 텍스트 데이터를 수집하고, ChatGPT 임베딩, BERT 임베딩, LIWC 및 사전 기반 위험 용어 접근 방식을 사용하여 추출된 특징을 기반으로 머신 러닝 모델을 훈련했습니다. 모델은 월간 폭음 및 5명 이상의 성 파트너를 갖는 행동을 예측하는 데 높은 성능(F1 점수 0.78)을 보였으며, PrEP 사용 및 과음 예측에는 중간 정도의 성능(F1 점수 0.64 및 0.63)을 보였습니다. 이러한 결과는 소셜 미디어 및 데이팅 앱 텍스트 데이터가 위험 및 보호 요인 행동에 대한 귀중한 정보를 제공할 수 있으며, 대규모 언어 모델 기반 방법이 MSM을 위한 확장 가능하고 개인화된 공공 보건 개입을 지원할 수 있는 잠재력을 보여줍니다.

Original Abstract

Men who have sex with men (MSM) are at elevated risk for sexually transmitted infections and harmful drinking compared to heterosexual men. Text data collected from social media and dating applications may provide new opportunities for personalized public health interventions by enabling automatic identification of risk and protective behaviors. In this study, we evaluated whether text from social media and dating apps can be used to predict sexual risk behaviors, alcohol use, and pre-exposure prophylaxis (PrEP) uptake among MSM. With participant consent, we collected textual data and trained machine learning models using features derived from ChatGPT embeddings, BERT embeddings, LIWC, and a dictionary-based risk term approach. The models achieved strong performance in predicting monthly binge drinking and having more than five sexual partners, with F1 scores of 0.78, and moderate performance in predicting PrEP use and heavy drinking, with F1 scores of 0.64 and 0.63. These findings demonstrate that social media and dating app text data can provide valuable insights into risk and protective behaviors and highlight the potential of large language model-based methods to support scalable and personalized public health interventions for MSM.

0 Citations

0 Influential

6.5 Altmetric

32.5 Score

Original PDF

No Analysis Report Yet

This paper hasn't been analyzed by Gemini yet.

댓글을 작성하려면 로그인하세요.

아직 댓글이 없습니다. 첫 번째 댓글을 남겨보세요!