Futuristic Data Portfolio

Data Analyst.
Machine Learning.
Intelligent Analytics Systems.

Transforming complex data into intelligent, scalable systems through analytics engineering, machine learning, generative AI, and real-time visualization.

35K+
Daily records processed in reporting workflows
30%+
Reporting discrepancies reduced through validation
40%
Research throughput improvement via automation
4+
High-impact AI and analytics projects showcased
Analytics Profile
Data + AI + Systems
Active
Signal GrowthLive View
Pipelines
Robust
Models
ML / AI
Dashboards
Realtime
About

A data portfolio built to feel like a high-end analytics platform.

This experience is positioned around more than dashboards. It combines analytics engineering, machine learning, synthetic data research, and production-style tooling into one cohesive technical identity.

Profile Narrative

Sri Satya Harsha Pola is a data analytics and machine learning professional with a Master’s in Data Science and hands-on experience across operational analytics, generative AI research, synthetic data modeling, and visualization-driven decision support.

His work spans Python, SQL, Spark, cloud platforms, interactive dashboards, and research-backed AI systems — giving him the ability to work across both exploratory analysis and scalable analytics implementation.

The result is a profile that feels stronger than a traditional data analyst title alone: part data engineer, part analyst, part ML practitioner, and highly effective at turning raw datasets into usable intelligence.

Master’s in Data Science

University of West Florida with strong coursework across ML, AI, deep learning, big data, and regression modeling.

Industry + Research Blend

Combines business-facing dashboard work with research in synthetic data, computer vision, and applied AI systems.

Technical Range

Comfortable across Python, R, SQL, Spark, Power BI, Tableau, Docker, cloud platforms, and ML frameworks.

Capabilities

Core strengths across analytics, ML, and scalable data workflows.

The portfolio layout uses premium cards to present his profile as a full analytics stack contributor rather than limiting him to one narrow lane.

Data Engineering & Pipelines

Building robust data flows with Python, SQL, ETL design, APIs, and scalable processing patterns for analytics-ready systems.

Python and SQL-based pipeline development
REST API ingestion and multi-source integration
Data profiling, cleansing, and validation

Machine Learning & Generative AI

Applying ML, deep learning, and synthetic data techniques to solve analytics, prediction, and privacy-preserving modeling challenges.

XGBoost, LSTM, TensorFlow, Keras
GANs, VAEs, SDV, Gaussian Copula
Predictive modeling and experimental evaluation

Visualization & Decision Support

Designing dashboards and visual analytics experiences that help stakeholders understand trends, KPIs, and operational risks in real time.

Power BI, Tableau, Plotly, Dash
Interactive KPI and monitoring dashboards
Storytelling through visual data products

Cloud, Big Data & Deployment

Working across AWS, GCP, Azure ML, Spark, Docker, and workflow tooling to move analytics systems from experimentation to usable products.

AWS, BigQuery, Azure ML
Apache Spark and Airflow workflows
Dockerized analytics deployments
Experience

Industry execution backed by strong research depth.

This section combines operational analytics experience with academic research work to show both practical delivery and advanced technical credibility.

Jan 2025 – Jun 2025
Data Analyst
Titanium Wireless
Engineered data pipelines using Python, SQL, and REST APIs for faster operational insights.
Built interactive dashboards across Python, Tableau, and Salesforce Analytics Cloud.
Supported reporting at scale with Apex-based monitoring over 35,000+ daily records.
Reduced reporting discrepancies by over 30% through profiling, cleansing, and validation.
Feb 2023 – Jun 2025
Research Assistant
University of West Florida
Led research in Generative AI, synthetic data modeling, and real-time fitness monitoring.
Built end-to-end pipelines with Python, R, TensorFlow, and SQL to improve research throughput by 40%.
Applied GANs, VAEs, Gaussian Copula, and SDV for high-fidelity synthetic datasets.
Co-authored publications and supported NSF-aligned proposal and manuscript development.
Dec 2021 – Jun 2022
.NET Intern
Wipro Limited
Supported enterprise web application development and debugging in Agile workflows.
Optimized SQL queries and performance reporting for backend analysis.
Collaborated across UI/UX, QA, and backend teams for reliable deployments.
Projects

Advanced data and AI projects presented like flagship analytics products.

These cards are designed to feel like premium product modules, emphasizing technical depth, measurable outcomes, and futuristic data storytelling.

Featured Project 01

Trade Insights Engine

Python • SQL • Spark • AWS • Flask • Tableau

Built a big-data trade policy analytics system using Spark, SQL, AWS, and machine learning to predict trade flows, logistics efficiency, and supply chain disruptions.

20% improvement in predictive analytics outcomes
Interactive dashboards and Flask APIs for real-time insights
Automated ingestion from external trade data sources
Featured Project 02

Synthetic Data Generation with CTGAN + Gaussian Copula

Python • SDV • CTGAN • Gaussian Copula • Scikit-learn

Designed and evaluated synthetic data generation pipelines for privacy-preserving analytics, validating statistical fidelity and downstream ML utility.

Compared CTGAN and enhanced SDV-based approaches
Supported privacy-preserving data experimentation
Maintained analytical performance and data integrity
Featured Project 03

Supply Chain Optimization Engine

GPT-Neo • SDV • Python • Dash • Machine Learning

Combined synthetic data and machine learning to model shipping delays, supplier reliability, and disruption scenarios in an interactive analytics environment.

22% improvement in forecasting accuracy
Dynamic risk and resilience planning dashboard
Synthetic scenario simulation for supply chain decision support
Featured Project 04

Real-time Fitness Monitoring

Python • MediaPipe • NumPy • Pandas • Computer Vision

Created a real-time computer vision system for motion tracking, posture classification, and exercise feedback using MediaPipe-based pose estimation.

Real-time exercise form detection
Pose-driven feedback loop for better user accuracy
Applied vision pipelines for performance monitoring
Research & Publications

Academic credibility that strengthens the technical brand.

Most analytics portfolios stop at dashboards. This one includes research output and publications, which elevates the profile immediately for technical and advanced analytics roles.

Publication Focus

Synthetic data, AI systems, and applied analytics research

The research section highlights a rare combination of publication-backed work in synthetic data, generative AI, computer vision, and cybersecurity-oriented machine learning. It adds serious depth to the overall portfolio identity.

Integrating Unsupervised and Supervised ML Models for Synthetic Data Analysis from VAE, GAN, and Variable Clustering (2024)

Real-time fitness monitoring with MediaPipe (2024)

Hybrid intelligence for DDoS defense: Combining generative AI, resampling, and ensemble methods (2025)

Generative AI: Comparing CTGAN and CTGAN with Gaussian Copula in Synthetic Data Generation (Conference / In Press)

Skills

A modern analytics stack grouped for clarity and visual polish.

The skills area is structured like a premium tooling dashboard, balancing analyst readability with technical depth.

Languages

PythonRSQLScalaSASApexC++

Analytics & ML

Scikit-learnTensorFlowKerasPandasNumPySparkAirflow

Visualization

Power BIDAXPower QueryTableauPlotlyExcelSeaborn

Cloud & Platforms

AWSBigQueryAzure MLDockerGitGitHubSalesforce
Contact

Let’s build analytics that move decisions faster.

Open to data analyst, analytics engineering, business intelligence, and machine learning-oriented opportunities.