dissertation_data.Rd
This dataset, curated specifically for training models in my dissertation, encompasses variables from the YRBS 2017, 2019, and 2021 datasets. Additionally, several variables have been specifically constructed by combining common variables; for example, 'RiskDrivingBehaviors' is a composite variable derived from 'SeatBeltUse', 'TextingDriving', 'DrivingDrinking', and 'DrinkingDriver'. The selection of variables is strategically focused on analyzing suicidality Rigorous curation ensured that each included variable has less than 80% missing data. The selection prioritizes variables that are pertinent to the analysis of suicidality. For details on the exclusion criteria and a list of removed variables, refer to 'inst/scripts/creatingCompleteDataset.qmd'. NOTE: Missing data for SuicideAttempts
#' This dataset exclusively comprises questions sourced from Appendix A of the YRBS 2017, 2019, and 2021, specifically focusing on the QN dichotomous variables. For comprehensive information, please refer to the documentation provided in the 2017, 2019, and 2021 National YRBS Data Users Guide. In this dataset, each variable follows a binary pattern: a value of 1 indicates the presence of the specified behavior, while a value of 0 signifies its absence. For instance, with regard to Suicide Attempts, a value of 1 signifies that adolescents responded affirmatively to having made suicide attempts, whereas 0 indicates a negative response.
Variables in the dataset include:
Survey year (e.g., 2017, 2019, 2021).
Weight variable for survey adjustment.
Participant's sex (Male/Female).
Participant's age.
School grade (9th–12th).
Sexual orientation of the participant.
Carrying a weapon on school property.
Carrying a gun.
Feeling unsafe at school.
Being threatened at school.
Having been forced into sexual intercourse.
Experiencing physical dating violence.
Being bullied.
Being cyberbullied.
Feeling hopeless for 2+ weeks.
Considering suicide.
Making a suicide plan.
Attempting suicide.
Currently smoking cigarettes.
Ever tried vaping.
Currently vaping.
Currently using smokeless tobacco.
Currently smoking cigars.
Attempted to quit tobacco use.
Had first drink of alcohol before age 13.
Currently drinking alcohol.
Currently binge drinking.
Ever used marijuana.
Used marijuana before age 13.
Currently using marijuana.
Ever used synthetic marijuana.
Misusing prescription pain medicine.
Ever injected illegal drugs.
Offered drugs on school property.
Ever had sexual intercourse.
Had sexual intercourse before age 13.
Had 4 or more sexual partners.
Currently sexually active.
Perceived as very overweight.
Does not drink soda.
Does not drink milk.
Does not eat breakfast.
Spends 3+ hours watching TV.
Spends 3+ hours on video games.
Attends physical education classes.
Experienced a concussion during sports.
Gets 8+ hours of sleep.
Does not consume sports drinks.
Does not drink water.
Has difficulty concentrating.
Race/ethnicity of the participant.
Composite variable for hard drug use.
Composite variable for sports and physical activities.
Composite variable for risky driving behaviors.
Composite variable for sexual abuse.
Composite variable for trying cigarettes.
Composite variable for not eating fruits or vegetables.
Involvement in physical fights.
data(dissertation_data)
A tibble with 36779 rows and 58 variables:
Dataset with Suicide Related Predictors Using the 2017, 2019 and 2021 YRBSS data