Product analyst with UX focus. Pioneering comprehensive B2C & B2B Analytics Strategies, proven to enhance core product metrics.
Problems. Context. Business impact. Why damn care?
Problems we face won't change until we handle them properly.● Rising healthcare costs ● Lack of personalized care in the insurance industry, ● Exacerbated by an aging population ● Generic approach to healthcare, create a pressing need for more efficient, patient-centric solutions.
These challenges stem from: ● High costs of advanced medical care.● Increasing chronic illness prevalence.● Systemic inefficiencies, highlighting a gap in personalized healthcare insurance.
Challenge 1. Get reliable data for analysis.
Compliance and privacy rollercoaster. Inevitable evil
Data Privacy and Compliance involving Legal Team, Data Security ExpertsEnsuring compliance with regulations like HIPAA and GDPR. Handling patient data with utmost confidentiality.2 weeks negotiations.
Data Collection and IntegrationGathering data from diverse sources like EHRs, insurance claims, and patient surveys. Integrating these into a cohesive dataset.Involved Backend Engineer, Product managers2 weeks
Data Quality Assurance and ValidationEnsuring the accuracy, completeness, and reliability of the data collected. Dealing with missing values and outliers.Involved Backend Engineer, Product manager, SME from Insurance and healthcare company1,5 weeks
Security ComplianceEnsuring the accuracy, completeness, and reliability of the data collected, validated and securely stored.1 week
> 1 month to get a validated dataset! So lucky to walk through this roller coaster.
Challenge 2. Making sense of data
Compliance and privacy rollercoaster. Inevitable evil
The dataset is a starting point for deeper exploration.
PatientID: Ranges from 1 to 1000, serving as a unique identifier.
Age: Ranges from 18 to 89 years, with an average of approximately 52.7 years.
Gender: Includes three categories, with 'Female' as the most frequent.
ChronicDisease: Two categories ('Yes' and 'No'), with 'Yes' being slightly more frequent.
HospitalVisits: Varies from 0 to 9 visits, averaging about 2.9 visits.
TreatmentCost: Ranges widely from 12 to 77854 units, indicating varied treatment expenses.
MedicationCount: Varies from 0 to 9, with an average of about 4.8 medications per patient.
RecoveryRate: Ranges from 0.29% to 99.76%, with a mean of 50%.
PatientSatisfaction: Ranges from 1 to 5, with an average score of 3.
Basic patterns
Pearson CorrelationTreatment Cost vs Hospital Visits: 0.032Hospital Visits vs Medication Count: 0.031Hospital Visits vs Age: Correlation = 0.027Age vs Medication Count: Correlation = 0.023
Spearman CorrelationHospital Visits vs Age: Correlation = 0.042Medication Count vs Hospital Visits: Correlation = 0.027Age vs Medication Count: Correlation = 0.021Treatment Cost vs Age: Correlation = 0.017
Correlations are pretty weak, however suggest relationships between various healthcare factors such as the frequency of hospital visits, the cost of treatment, the number of medications prescribed, and the age of patients.
Feature Engineering
We can create a new features, for instance, Age that categorizes patients into different age groups (eg, 1 = 0-10 yrs, 2 = 10-20 yrs)This might help in better understanding the age-related trends we'll observe in our analysis.The same related for other features.
Observations
● Most variables exhibit a relatively uniform distribution. ● 'TreatmentCost' shows a skewed distribution, with most values clustered at the lower end.
● Absence of extreme outliers in the data.
● Hospital Visits and Medication Count: A moderate correlation suggests that more hospital visits might correlate with a higher medication count.
● Age Distribution: Appears to be evenly distributed, suggesting a diverse patient demographic.
● Treatment Cost: Skewed towards lower costs, indicating that most treatments are not highly expensive.
● Patient Satisfaction: Shows a broad distribution, which could be influenced by various factors such as treatment cost, recovery rate, etc.
Hypotheses for Further Investigation
● Age and Patient Satisfaction: Is there a significant correlation between age and patient satisfaction? Older patients might have different satisfaction levels due to varied healthcare needs.
● Chronic Disease and Healthcare Costs: Do patients with chronic diseases incur higher treatment costs and hospital visits?
● Recovery Rate and Medication Count: Does a higher medication count lead to a better recovery rate, and how does it impact patient satisfaction?
Data Modeling and Analysis
● Test these hypotheses using statistical methods and machine learning models.
● Use regression analysis to understand the impact of various factors on patient satisfaction and treatment cost.
● Employ classification models to predict patient outcomes based on their profiles.
Action plan
● Begin with feature selection and engineering based on the hypotheses.
● Select and train appropriate models for each hypothesis.
● Evaluate the models and interpret the results to generate insights.
Hypothesis 1 Age and Patient Satisfaction
Linear Regression with Target Variable: Patient SatisfactionFeature: AgeEvaluation Metrics: Mean Squared Error (MSE): 2.04 indicates the average squared difference between the predicted and actual values.
R² Score: -0.007 suggests that the model does not effectively explain the variance in patient satisfaction based on age alone.
Summary
There is no significant linear relationship between age and patient satisfaction based on this model.
This suggests that patient satisfaction might be influenced by factors other than age, or the relationship might be non-linear.
Hypothesis 2 Chronic Disease and Healthcare Costs
Linear Regression with Target Variable: Treatment CostFeature: Chronic DiseaseEvaluation Metrics:
Mean Squared Error (MSE): 1.13R² Score: -0.003 suggests that the model does not effectively explain the variance in patient satisfaction based on age alone.
SummaryThere is no significant linear relationship between the presence of chronic diseases and healthcare costs based on this model.
This indicates that other factors might play a more significant role in determining treatment costs.
Hypothesis 3 Recovery Rate and Medication Count
Linear Regression with Target Variable: Recovery RateFeature: Medication CountEvaluation Metrics: Mean Squared Error (MSE): 0.68R² Score: -0.00048 suggests that the model does not effectively explain the variance in patient satisfaction based on age alone.
SummaryThere is no significant linear relationship between medication count and recovery rate based on this model.
This indicates that the recovery rate might be influenced by a combination of factors, and a linear model using only medication count is not sufficient to predict recovery rates.
Overall Insights and Recommendations:
● The linear regression models used to test the hypotheses indicate that single variables (age, chronic disease presence, medication count) do not significantly explain the variances in patient satisfaction, treatment costs, and recovery rates.
● This suggests the need for more complex models that can account for multiple factors and their interactions.
● Further analysis could involve using more sophisticated machine learning techniques like decision trees, random forests, or neural networks, which can capture non-linear relationships and interactions between multiple variables.
Overall Insights and Recommendations:
● The linear regression models used to test the hypotheses indicate that single variables (age, chronic disease presence, medication count) do not significantly explain the variances in patient satisfaction, treatment costs, and recovery rates.
● This suggests the need for more complex models that can account for multiple factors and their interactions.
● Further analysis could involve using more sophisticated machine learning techniques like decision trees, random forests, or neural networks, which can capture non-linear relationships and interactions between multiple variables.
Official Lenovo. Custom edition for Ukrainian market. 3x Turnover in 1 year.
Lenovo Official Ukraine
PANDORA SPECIAL DESIGN FOR UKRAINE
In Lead Generation and Acquisition, I turn complexities into clear solutions, solving unique challenges.
Smart maintenance SAAS solution for railroad switches.
A new approach in design for asset operators that control assets' health and manage occurred and predicted issues with maintenance team.
SMART MAINTENANCE FOR RAILROAD SWITCHES
Japanese railroads research how to apply smart maintenance solutions
My personal approaches to understand how to apply sensors to decrease operational costs for different railway operators.
RESEARCH ON JAPANESE RAILROADS
GS Diamonds Australian jewelry online store
Now GS Diamonds' usability is far better than it's main world-known competitors like BlueNile and James Allen.
BEAUTY SAVES THE WORLD
Analytical-insightful SAAS platform Reviewscanner
The most up to date insightful platform to understand how to market a product based on user reviews
#SAAS #Retail Analytics #Research
ACTIONABLE INSIGHTS FROM PRODUCT REVIEWS
Plan B Berlin-based Escape rooms challenges.
Completely redesigned whole user flow due to new products like VR games and booking system updates.
GET READY FOR THE NEXT QUEST
Smart photo composition using AI suggestions
The hardest part in the 48 hours challenge to gather the dev team. Idea to increase photographers' productivity, save time and money using AI suggestions.
ML
ML Photography
Predictive photo
85 hours
Out of 103 teams, we won a 3rd place. Filed 2 patents for the technology.
AI PHOTO COMPOSITION HACKATHON
London HackXLR8 Hackathon
The hardest part in the 48 hours challenge to gather the dev team. Idea to increase photographers' productivity, save time and money using AI suggestions. Food photographers' solution to improve composition. We save hours of routine work.
ML
ML Photography
Predictive photo
24 hours
Real-time AI based suggestions for Lighting and well-composed photos.
10 times Faster way to make Fantastic composition.
Food photographers' solution to improve composition. We save hours of routine work.
How it works:
1, We recognize main objects on your scene.
2. AI searches for the best composed images
3. You get real-time food photo suggestions
Presentation
24 hours team coding challenge