NLP-Enhanced Sumerian-English Translation: Classification and NER Techniques

NLP Python TensorFlow NER

Application of Natural Language Processing techniques to improve translation accuracy of ancient and low-resource languages. Developed classification-translation models for Sumerian-English transliteration and Named Entity Recognition models to bridge lexical gaps from unique Sumerian proper nouns, enhancing translational accuracy and semantic understanding.

Sumerian NER visualization

Natural Language Sentiment Analysis: Custom Vectorization and Tokenization Methodologies

NLP Python ML TF-IDF

Custom TF-IDF vectorization pipeline optimized for sentiment classification with frequency-based feature selection and context-aware n-gram generation (n=1,2,3). Implemented hashmap-based term frequency calculator reducing computational complexity from O(n²) to O(n). Achieved 4.2% accuracy improvement over scikit-learn's baseline on IMDb dataset, particularly excelling at identifying nuanced sentiment expressions and negation patterns.

Technical highlights: Modified IDF weighting with corpus-specific entropy measurements, adaptive stopword filtering, modular architecture for configuration-driven feature extraction without retraining.

XML Data Warehouse for Data Forecasting and Analysis

Data Engineering Java MySQL ETL

Hybrid OLTP/OLAP data warehouse utilizing SAX parsers for efficient XML processing with O(1) memory complexity. Custom metadata-driven ETL pipeline with XPath query optimization reduced extract processing time by 68% compared to DOM parsing. Snowflake schema design with SCD Type 2 for historical tracking, materialized views for aggregation acceleration, and MySQL stored procedures for transactional consistency.

Performance optimizations: Parallel data loading with configurable thread pools, XML schema validation with custom DTD constraints, bitmap indexing on high-cardinality attributes. Achieved sub-second query response for 92% of OLTP operations while supporting complex OLAP aggregations on 3.2TB of historical pharmaceutical sales data.

Unified File System Query Tool with Hierarchical DOM Analysis

Systems Programming C++ Data Structures

Cross-platform file system abstraction tool implementing custom B-tree indexing with O(log n) query performance. Platform-agnostic core translates between POSIX and Win32 filesystem APIs through adapter pattern, enabling unified XPath-inspired query execution across heterogeneous systems.

Key features: Memory-mapped file access for high-throughput operations, lazy-loading directory traversal to minimize memory footprint, custom binary serialization protocol reducing metadata overhead by 78% compared to JSON. Benchmarks demonstrated 3.2x faster recursive search performance with 40% reduced memory consumption.

News Article Classifier with Custom K-Means Clustering Implementation

Machine Learning Python Clustering TF-IDF

Custom K-Means clustering algorithm for text data analysis on the 20 Newsgroups dataset. Utilized TF-IDF vectorization to categorize text documents into clusters, enhancing understanding of content similarities. Benchmarked against scikit-learn KMeans using Homogeneity, Completeness, V-measure, Adjusted Rand Index, and Silhouette Coefficient metrics, focusing on optimizing clustering quality and execution efficiency for complex textual data.

Implementation of Energy and Performance-Aware Task Scheduling in Mobile Cloud Computing

C++ Algorithms Optimization

C++ implementation of the theoretical framework for energy-aware task scheduling in mobile cloud computing. Constructs Directed Acyclic Graphs (DAGs) of tasks with local and cloud execution times. Performs initial task assignment and prioritization to minimize total completion time, then iteratively explores task migrations between local cores and cloud for energy efficiency. Linear-time kernel rescheduling algorithm ensures all precedence constraints and resource capacities remain satisfied while optimizing both makespan and energy consumption.

Reinforcement Learning in Complex Route Navigation and Spatial Decision Making

Reinforcement Learning Python Q-Learning SARSA

Simulation exploring Q-Learning, SARSA, and Actor-Critic algorithms in a complex stochastic grid-based maze environment (248 navigable cells) with dynamic challenges including walls, bumps, and oil slicks. Each algorithm evaluated over 10 independent runs of 1,000 episodes, optimizing paths and maximizing rewards within defined penalty and reward structures.

Reinforcement Learning in Cancer Research: Optimizing the p53-Mdm2 Feedback Loop

Reinforcement Learning Computational Biology Python

Applied reinforcement learning to fine-tune activation of the p53-Mdm2 negative feedback loop, crucial for cell cycle regulation and tumor suppression. Developed optimal control policy to effectively manage key network components (ATM, p53, Wip1, MDM2), maximizing therapeutic efficacy by maintaining an active p53 pathway. This novel approach offers potential for inhibiting tumor growth and enhancing cellular repair mechanisms in cancer therapy.

Bayesian Optimized Neural Networks for Minimum-Error Classification

Deep Learning Python Bayesian Methods

Bayesian-optimized MLP architecture for minimum-error classification of multivariate random vectors with non-Gaussian distributions. Custom regularization combining L2 weight decay with Bayesian priors to approximate posterior probability distributions, achieving 97.2% accuracy. Implemented adaptive learning rate scheduler, conditional batch normalization for non-IID data, and entropy-weighted loss function. Parameterized grid search across 128 configurations identified optimal single hidden layer with 42 neurons, achieving classification error within 0.3% of theoretical Bayes error limit.

API Triggered ETL Pipeline for CSV Processing

Data Engineering Python ETL APIs

ETL pipeline implementation for efficiently processing and transforming CSV files into structured data points and features. Responsive to API calls, the system methodically extracts data from CSV files, applies necessary transformations, and uploads processed data to a relational database. Facilitates dynamic data management, supports data analysis, and provides scope for extended API integration.

Vehicle Localization Optimization in 2D Space via Range Measurements and MAP Estimation

Bayesian Methods MATLAB Optimization

Implementation of Maximum A Posteriori (MAP) estimation to enhance vehicle localization accuracy in 2D environments. Bayesian model integrates range measurements to multiple landmarks with existing positional data, enabling highly accurate estimation of vehicle's most probable location. Effectively addresses noisy distance measurements and prior knowledge, ensuring precise navigation despite environmental complexities and measurement inaccuracies.

Vehicle localization visualization

Snake Game Engine

Web Development Vue.js JavaScript HTML/CSS

Game engine for the classic Snake game developed using Vue.js with vanilla HTML and CSS. Vue.js streamlines state management and facilitates real-time rendering, ensuring dynamic updates are efficiently handled without browser reloading. Implements dynamic management of game state including snake movement, collision detection, and score tracking with clean color schemes, borders, and animations.

Snake game screenshot

Minimax Tic-Tac-Toe

Web Development JavaScript AI HTML/CSS

Game engine for Tic-tac-toe developed using vanilla JavaScript, HTML, and CSS. Implements the Minimax algorithm to simulate intelligent opponent moves and strategize optimal responses. The algorithm enables the system opponent to make strategically sound decisions, mimicking human-like decision-making processes, adding challenge and realism to the game engine.

Tic-tac-toe screenshot

Sound Processor and Song Note Compiler

C C++ Audio Processing

Sound processing and song note compilation platform for real-time audio signal processing and synthesis. Enables generation of various waveforms (Sine, Square, Triangle, Sawtooth) through user-adjustable parameters. Supports complex audio signal layering with detailed amplitude control and modulation. Advanced digital filters including reverb create short-lived echoes to simulate different acoustic environments. Song player functionality decodes various audio formats, arranging decoded samples into seamless playback sequence for sound design or educational purposes.

Purple Bug Mobile Companion App

Android Java Kotlin Mobile Development

Summer 2020 Internship Project
Android companion app to interface with UV light hardware device component of Purple-Bug UV ride-share cleaning product. Developed using Java and Kotlin with stable and responsive UI. Network communications implemented using Retrofit for effective data transmission between mobile app and backend services. Image loading and caching optimized using Glide. Local data storage implemented using Room (abstraction layer over SQLite). Complex asynchronous tasks managed using RxJava and Kotlin Coroutines for smooth operation.