Gabriel Kinshuk

Machine Learning Engineering & Data Science

I build rigorous, scalable data pipelines and machine learning systems for high-stakes research and business environments. By combining statistical modeling with mathematical optimization, I transform complex data into actionable insights that support strategic decision-making.

My work spans automating research workflows, developing computer vision infrastructure, and applying optimization to solve supply chain challenges. Across every project, I prioritize reproducibility, engineering rigor, and measurable impact.

Professional Experience

Computer Vision Capstone Project

Industry Partner: Google

January 2026 - Present

In Progress

  • Conducting a literature review on action recognition techniques in computer vision and analyzing EHS regulations across different regions.
  • Performing data augmentation on training datasets to improve model robustness and support future experimentation.

Project Goals / Deliverables

  • Prototype a computer vision pipeline to detect EHS-related incidents and automate structured data extraction for reporting.
  • Develop models for action recognition and temporal sequence analysis to identify potential causes and risk factors.
  • Establish modular ML workflows and infrastructure to support iterative experimentation and reproducibility.

Graduate Researcher

Biomedical Engineering Research

2022 - 2025

Designed and implemented automated data orchestration systems for complex research datasets. Built Python-based pipelines emphasizing modularity, reproducibility, and automation.

  • Constructed a data pipeline in MATLAB and Python utilizing machine learning and statistical modeling to determine module compatibility in hybrid transcription regulators.
  • Published two review papers and submitted an experimental paper focusing on predictive modeling for genetic circuits, funded by a government grant.
  • Composed a 70-page research paper and technical documentation in LaTeX, including self-designed figures to illustrate complex biological and statistical models.
  • Automated data processing workflows and implemented contemporary analysis techniques, resulting in a several-fold increase in data throughput.
PythonMATLABData Orchestration & PipelinesStatistical & Predictive ModelingResearch Workflow AutomationShell Scripting (Bash & Batch)

Education

Master of Science in Business Analytics

University of Texas at Austin

In Progress

Advanced Machine Learning & Predictive Modeling

Statistical Analysis & Data Mining

Business Intelligence & Analytics

Master of Science in Biomedical Engineering

University of North Texas

Completed 2025

Computational Research Methods

Data Pipeline Development

Research Automation & Infrastructure

Projects

TileShift

Puzzle game with procedural level generation using BFS state-space search and solvability verification. Features flood-fill algorithms, comprehensive testing, and MVC architecture.

JavaAlgorithmsSoftware TestingSoftware Engineering

NYC Transit Analysis

A Tournament of Models for NYC MTA ridership forecasting. Evaluates ARIMA and Bayesian AR(7) to predict subway, bus, and bridge demand for transit supply chain optimization and capacity planning.

RTime SeriesStatistical ModelingForecastingBayesian Methods

Applied Optimization for Business

Collection of mathematical optimization projects focusing on Linear Programming (LP), Nonlinear Programming (NLP), and integer-based decision modeling.

Mathematical OptimizationOperations ResearchLinear & Integer ProgrammingRisk ManagementGurobi

Multimodal Price Prediction

A multimodal machine learning pipeline to predict ASOS apparel prices by fusing tabular metadata, NLP (BERT), and Computer Vision (ConvNeXt) in a Late-Fusion architecture to capture brand prestige and aesthetic value.

PyTorchComputer VisionNLPMultimodal LearningApplied Machine Learning

Protein Parser

Bioinformatics tool with a GUI built on top of BioPython for parsing and analyzing protein sequence data.

PythonBioinformaticsData ParsingScientific Computing

Austin Restaurant Whitespace

Market analysis identifying underserved Austin restaurant segments by transforming unstructured images into semantic labels. Features a computer vision-to-NLP pipeline using TF-IDF and Cosine Similarity to evaluate aesthetic consistency and market positioning.

PythonMarket AnalysisComputer VisionNLPUnstructured Data AnalyticsAPI IntegrationStatistical Modeling

Technical Skills

Engineering

PythonJavaSQLGitLinux/UnixSoftware ArchitectureDockerTesting & ValidationAlgorithm Design (Graphs, Search, Optimization)Software Development

Machine Learning

PyTorchTensorFlowComputer VisionNatural Language ProcessingMultimodal LearningModel Evaluation & ExperimentationML Pipeline Design

Data Engineering

SnowflakeData Pipelines & AutomationResearch Workflow EngineeringData Parsing & ValidationFeature EngineeringData WarehousingShell Scripting (Bash, Batch)

Analytics & Statistics

Time Series AnalysisBayesian ModelingA/B TestingForecastingMathematical OptimizationRisk Modeling

Recent Activities

Google Computer Vision Capstone

January 2026

Began an industry-sponsored capstone focused on action recognition for EHS incident detection in surveillance video, including structured report automation and roadmap development for temporal modeling.

Trip to the Bay Area

January 2026

Visited San Francisco and San Jose and spent time with friends working in tech.

Trip to India

December 2025

Traveled to Delhi and Haryana for my sister’s wedding and spent time with family.

Roo Hackathon

October 2025

Participated in the Roo Hackathon and collaborated on a rapid prototype under time constraints.

Relocated to Austin for MSBA at UT Austin

July 2025

Moved to Austin to begin the M.S. in Business Analytics at UT Austin, focusing on machine learning, optimization, and applied statistics.

Trip to Birmingham, AL

April 2025

Visited Dotdash Meredith test kitchens and went hiking outside the city.

Master’s Thesis Defense

December 2024

Successfully defended my M.S. thesis on protein structure prediction using statistical modeling over biological sequences.

American Chemical Society Conference

March 2024

Presented research applying statistical methods to protein structure prediction using sequence-based features.