Portfolio Website
I am a graduate student (M.S. in computer science) at Oregon State University. Striving to contribute clean code and robust software engineering capabilities, with a keen interest in opportunities within machine learning, computer vision, or data science.
🌟 Looking for Full-Time Opportunities 🌟
Degree | University | Duration |
---|---|---|
M.S., Computer Science | Oregon State University | Sep 2022 - present |
B.Tech., Electronics and Communication | PES University | June 2022 |
Graduate Research Assistant @ Oregon State University (Sep 2022 - Present)
Currently working on Sim to real experiments and point cloud based object detection (3D Vision). Working on TR3D (Point cloud 3D object detection) for custom data (for all rotations across the axes).
- Research focused on capturing inter‑object and object‑environment interactions at long ranges, exploring 3D and point cloud versions.
- Leveraged the Region Proposal Interaction Network to enhance model performance, yielding remarkable results on our custom MCS DARPA dataset
- Used Motion Indeterminacy diffusion model for diverse trajectory prediction for intuitive physics experiments.
Robotics intern @ Nokia Bell Labs (NJ) (Jun 2023 - Aug 2023)
Research intern @ Indian Institute of Science (Jul 2021 - Dec 2021)
Student intern @ Nokia (Feb 2021 - Feb 2022)
A novel motion planning framework for general locomoting systems, beyond car-like robots, integrating dynamically feasible motion primitives using Deep Deterministic Policy Gradients (DDPG) reinforcement learning and an artificial potential field for accurate learning guidance.
This project aims to build an explainable CNN model to classify Mars HiRISE images, providing insights into why the model makes certain classifications. By achieving interpretability, it will enhance trust in the model’s predictions and help identify biases and prejudices in its decisions.
The task of text extraction from documents is commonly targeted as a computer vision problem with text postprocessing done to ensure correctness. A sub-problem in that domain is more focused on identifying key-value pairs in a scanned document in order to populate in an exist- ing database. In this project we propose a method for extracting these key-value pairs from document images usingboth layout information and textual associations. Thus, this is a multi-modal approach in which we use image embeddings from FastRCNN integrated with BERT model to combine text, layout, and image features. We create a different variant of positional embeddings and pretrain BERT using masking. This model is then fine-tuned for the downstream task of key-value pair extraction. This method has applications in information extraction and quick document analysis tasks.
Curriculum learning utilizes a novel scheduling algorithm based on standard deviation of class label distribution. Input data is represented as an array with dimensions (m samples, C channels, H height, W width), each with corresponding output labels. The standard deviation of class distribution reflects the difficulty of segmentation, with higher values indicating more challenging samples. Training begins with the most difficult examples, gradually incorporating easier ones in subsequent epochs. Results demonstrate that curriculum learning achieves convergence to full data training within 40 epochs, while also reducing training time by 30% compared to standard approaches. Presented as 2022 MICCAI poster.