Door and obstacle detection for Visually impaired


Course : Computer Vision

Overview
This project was to design a navigation system for the visually impaired. Since this has been a problem that has gone through research over the years we had to learn and apply some of the previous works done by researchers to solve this problem
Task 1
For the first task, we detected doors based on the geometry and heuristics that would help us detect doors within an indoor environment. We used a 3 D camera so that we could accurately capture the depth and distance information. With this we were able to detect whether the doors are open or closed.
Task 2
The second task was based on the objective to detect obstacles other than doors within an indoor environment. This task was based on K means segmentation ,sift points detection , sift localisation and contour grouping. With this we were able to detect obstacles and obtain distance metrics.

Read Full Detalis

Prominent Projects

Feature based classifiers for somatic mutation prediction


Course : Data Mining

Overview
The objective of this project is to predict somatic mutation, given a protein sequence using a feature based classifier. This is a neutral vs somatic mutation binary classification problem. Somatic mutations can be divided into passenger and driver mutations. Driver mutations are considered as cancer-causing mutations and an analysis of features affecting somatic vs neutral classification can be a basis of the task of distinguishing between driver and passenger mutations to predict cancer with the mutation information.

Read Full Detalis

Opiate prescription Analysis using Machine Learning


Course : Applied Machine Learning

Overview
The project was based on the impending danger of exceeding Opiate drug overdose. The project was based on the objective to predict whether a prescriber is Opiate prescriber or not based on their non opiate prescription pattern. The recent drug overdose death due to Opiate drug intake was the main push for us to work on this Project.

Read Full Detalis

Twitter Sentiment Analysis


Course : Data Visualization

Overview
This project is a twitter sentimental analysis project based on the 2016 US election data. The main objective for us was to predict the chances of the electoral candidates HillaryClinton and Donaldtrump on various US states
Description
The data extraction process for this project was critical as we had to crawl data using the Twitter API. The data extraction process involved collecting data that tweeted about the twitter mention tags @realDonaldTrump and @HillaryClinton. We were able to extract data from different parts of the world for our project. Since our main objective was to concentrate on the sentiments of the people in US we stuck to the tweets tweeted from different parts of the US. We were then able to create visualizations using cartograms and other visualization methods to intuitively understand the sentiments of the people from different states in the US.

Read Full Detalis

Citation recommendation system - Microsoft Academic Graph data analysis


Course : Big Data Analytics

Overview
This project works on the hypothesis of a citation recommendation system.
Approach 1
If two papers share a similar title, then the papers being cited would also contain a common theme / topic. It is because of this hypothesis that we calculate the similarity of titles and provide similar titled papers as inputs for collaborative filtering.
Approach 2
We find that titles as Bag of Words representation by itself cannot have enough information to retrieve similar papers. We introduce a concept called "context citations". This concept is similar to word2vec's philosophy "The word is explained by the company that it chooses", we propose " A paper is characterized by the paper that it cites" more than the title similarity. This is also a smart way of avoiding the cold start problem by giving the initial inputs for the recommender system.

Read Full Detalis

Yelp dataset Challenge


Course : Search

Overview
This project is an online challenge based on the Yelp dataset.
Task 1
For the first task, the dataset was downloaded from Yelp and the process of stemming and stop words removal was done on the reviews of the customers. From this, we generated a unigram, bigram and trigram model. A tf -idf weighting was done to calculate the weights of words. With these our sole aim is to predict the restaurant category the review falls under. The Naive Bayes and LSH algorithms were used for this task.
Task 2
For this task we gather the customer visits to each restaurants from the source files. We then use Matrix factorisation methods with solvers like Alternate Least Squares (ALS) and Stochastic Gradient Descent (SGD). The idea here is to build a recommednation system that would predict on which restaurant the user would visit next.

Read Full Detalis

Professional Projects

Three Tier Telecom web interface using C#


C# project

Overview
This project is a three tier telecom system project developed using C#. The project provided me a good head start with C#. I was able to design an effective front end and implement necessary validations and understand the working of C# by developing this project.

Inspect Source code on GITHUB