Portfolio

Hi, my name is Ashish Karel.

I am an avid manga reader, sci-fi reader, and loves to run my current PR is 2.1km/7minutes. I love to learn about how things work, whether that be studying good coding practices, engineering techniques, or Computer Vision methods.

Much of my experience revolves around building Computer Vision and deep learning applications, but I also strive to gain a deeper understanding of the world so that I can expand my skillset and build new and amazing things.

Get to Know Me


My Professional Experience

KritiKal Solutions

Computer Vision Engineer

  • Document-image OCR and retrieval system using image processing and tesseract
    • Task was to extract information from NCERT textbook images and retrieve the page information for further use case.
    • Users were mostly indian school kids or parents doing self study.
    • Worked on line extraction from books image which sometimes were blurred, tilted, and curved using contour processing
    • Main hindrance to accurate system were blurred images(motion or focus), white noise or uneven lighting while capturing images.
    • Performed quality check using variance of laplacian and deblurring after removing gaussian noise(assumed blur noise to be gaussian)
    • OCR was used on all extracted lines from the images and saved text file for retrival.
    • Evaluated tablet which was going to be used for our OCR system, such as focus, exposure, shutter time, frame quality etc and comapred performance of system on normal images and images from tablet.
    • Tech-stack: c++, Opencv, python, ocr, linux, windows
  • Speech Recognition and analysis in kids speech to augment their language learning phase
    • Created pipeline to collect data for kids below 11 years
    • Performed benchmark of existing SOA (like DeepSpeech2) on the dataset for speech to text
    • Research on speech recognition model for kids speech and the feasability of existing SOA.
    • Tech-stack: python, pytorch-kaldi, tensorflow, linux, sst engines
  • Counting the number of crates unloaded in a factory warehouse using video feed
    • Data annotation and analysis from warehouse camera feed
    • Performed background subtraction for detecting regions of motion
    • Trained yolo detection for crate detection and then counting
    • Python executable was delievered for user testing
    • Tech-stack: darknet, c++, python, linux, windows, Opencv
  • 3D-barcode detection and spoof classification from 2D images
    • Trained Yolo to detect the 3d marker in images .
    • Proposed data collection for classification using micromax smartphone .
    • Proposed different types of printing paper for stimulating spoof attack like Matte, Gloosy, Laser etc
    • Spoof classifier was trained using SVM on texture features extracted using LBP.
    • Tech-stack: darknet, c++, python, linux, windows, Opencv

Computer Vision Trainee.

  • Displacement calculation of washing machine using video feed
    • Required to predict the displacment within -+5mm of actual displacement.
    • Performed camera calibration, and perspective transformation from image to world coordinate system.
    • Proposed camera and aruco marker ideal setup after numerous experiment
    • performed experiment and error analysis on geometric algorithms in 3D world; this repeated till we get our required threshold error.
    • Developed UI for the desktop application which runs 4 camera parallely.
    • Finally application is used during manufacturing process to identify faulty washing machines.
    • Tech-stack: c++, windows, QT Designer, Opencv
  • Face Detection comparision survey
    • Compared Opencv’s (Haar and cascade based face detector) and dlib(HoG + SVM based face detector) face detection methods.
    • Analysed their performance by varying the distance.
    • Tech-stack: Opencv, Dlib, python, linux
  • 3D bone segmentation from MRI images of patients for pre-surgery analysis
    • Client wants us to handle the 3D patients bone segmentation for surgeons to perform virtual operations before operating on patients
    • Researched on clients segmentation tool and human autonomy before presenting our solution
    • Performed live segmentation in a meeting and fine tuned the results using hyper-parameters in the tool
    • Proposed to supervise more than 10 employees for the task
    • tech-stack: MRI scans, windows, segmentation tool
  • Grading Skin Health
    • Proposed to predict the health of skin after accounting wrinkles, tone, texture , marks(pimples or lesions)
    • Image processing algorithms such as LBP for tone and texture score, edge detection and fitting to polynomial function for wrinkles score and conture based detection for marks or non-uniformity score in skin.

Cell Propulsion Limited

Python Developer Intern.

  • Sensor Fusion
    • Part of Computer Vision team on Self Driving Car.
    • Setup Texas instrument’s radar sensor in my laptop and processed the binary data into visualization plot
    • Performed clustering of detection to decrease the number of false detection (using linear kalman filter and dbscan)
    • Evaluated the radar by mounting on self driving car to determine its feasibility
    • Proposed 12 radar required for covering detection from all directions
    • Tech-stack: Python, Kalman filter, linux, Texas instrument radar sensor

LegalHub Technologies Pvt. Ltd.

NLP Engineer.

  • Document Retrieval System
    • Developed Search Engine for Indian Law DataSet.
    • We have tested different types of searching algorithms on Indian Law DataSet
    • Implemented filtering on search results and deployed an overall system on localhost.
    • Methods used: word2vec, doc2vec, LDA, n-gram’s, tf-idf
    • Tech-stack: python,linux, keras, AWS
  • Document Summarizer
    • Implemented extraction based summarizer and query based summarizer .
    • Tech-stack: python, keras, linux, AWS

Silversparro Technologies Pvt. Ltd.

Deep Learning Engineer Intern.

  • Worked on Masked-RCNN to segement text lines from the reciept images to improve existing OCR model
    • Tech-stack: python, tensorflow, keras, AWS
  • Worked on speech to text for a large e-commerce company on customer service recordings
    • Tech-stack: python, tensorflow, keras, AWS
  • Worked on voot website customer data to predict the customer engagement: performed text analysis and cleaning(>40GB) and initial model creation
    • Tech-stack: python, tensorflow, keras, sst engines, AWS

LNMIIT

  • Sub-coordinator Of Data-Science club
    • I was a sub-coodinator of Data-Science of CYBROS(a technical community club at LNMIIT), introducing and guiding students at LNMIIT in Data Science and sub fields.

NVIDIA Developer Conference 2019

  • Participation as a Student
    • Participated at NVIDIA’s developer conference held at mumbai.
    • This conference was aimed to bring all developers together and discuss current technologies in Deep learning and how industries utilises them.

Talk on Mask-RCNN on Image-Segmentation.

  • Particiapted as a Presenter
    • I had delivered a talk on Masked-RCNN at Delhi’s Data Science Meetup.
    • Report can be found here.

My Research Experience


Projects

  • HackCovd 2.0
    • Created a real-time desktop integrated with surveillance camera to monitor covid regulations are followed
    • Used nanodet(anchor free detection model) for pedestrain detection
    • For face detection used resnet10+ssd and mobilenetv2 for mask classification
    • Tech-stack: python, pytorch, tensorflow, pyQt5, numpy, linux
  • Mask and Helmet classifiaction for construction workers
    • Classified if construction workers wore helmet and mask
    • multilabel classification: mobilenetv2, accuracy obtained 82%
    • Tech-stack: python, pytorch, colab, numpy
  • Institute Map
    • Guidance mobile app for people’s first visit(like parents, new joinee and admission students) to LNMIIT being a vast campus of 100 acres with huge infrastructure .
    • In this project we collected multiple images of 7 different locations of LNMIIT .
    • With this app, user just need to input its surrounding image, and it will recognises the input image, this can locate and guide user to pinned location in college map.
    • Api creation using flask, and deployed on heroku .
    • Report
    • Tech-stack: numpy, opencv, sklearn, python, linux, heroku, flask
  • Cats vs Dogs classifiaction
    • Implemented a VGG-16 net from scratch
    • Used pretrained weights as initialization in keras to solve this problem.
    • Tech-stack: numpy, python, linux, keras, AWS
  • DeepLearning Implementations
    • Handwritten digit recognition on MNIST implemented using tensorflow.
    • Implemented logistic regression using Tensorflow.
    • Implemented linear regression using Tensorflow.
    • Implemented Shallow neural network in pytorch.
    • Tech-stack: numpy, python, linux, tensorflow, pytorch, AWS
  • Shallow NN code from scratch
    • Implemented neural-network code from scratch using python and numpy.
    • Accuracy of 92% obtained on MNIST dataset.
    • Tech-stack: numpy, python, linux
  • Clustering-species
    • To cluster genetic marker of different species required complex clustering algorithms and simpler KMeans won’t be effective.
    • Hiearchical clustering and elbow method for number clusters determination .
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • Content-based recommender system
    • Implemented a content based recommender system to recoomend genre, books for user for a typical bookstore website.
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • Loan Prediction
    • To help Indessa Bank identify default members(unable to satisfy terms of loan).
    • Dataset containing 1 lakh members detail and with 45 features each.
    • Applied feature engineering, statistical test (like chi-square test ,t-test and Pearson correlation),visualisation techniques(box plot,scatter plot,histogram and bar chart)
    • Tested Logistic Regression,Random Forest and XGBoost.
    • I was able to secure a rank within top 4% of users.
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • The Cab Service
    • Contest held on hackerearth, aimed to help company predict the trip fare amount using modern Machine Learning approaches.
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • Titanic: Machine Learning from Disaster
    • Binary classification problem, to predict whether a person on titanic survived or not.
    • Obtain an accuracy of 81.34%,within top 7%.
    • Used several models like Logistic Regression,SVM and Decision tress,but after performing data analysis and feature extraction its seems RandomForest Classifier results performed well on all evaluation metrics.
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • House Prices: Advanced Regression Techniques
    • This is a regression problem: based on information about houses we predict their prices.
    • Used Supervised Learning methods to predict Boston house prices.
    • With ensemble modelling and feature engineering my model was finally able to predict prices with confidence around 80%.
    • Tech-stack: numpy, python, linux, sklearn, pandas
  • HandWritten Digit Recognition
    • I used Machine learning multi class classification models like Logistic regression, SVM, KNN with help of sklearn.
    • Tech-stack: numpy, python, linux, sklearn