Hey! I am

Vedant Jolly

I'm a

About

About Me

I am a computer engineering student who loves technology and who wants to think and create different logical and technical solutions for the problems in society using the power of technology. Bringing data-driven solutions and performing R&D for improvising existing techniques is what I look forward to work on.

  • Name: Vedant Jolly
  • Address: Mumbai, India
  • Email: vedantjolly2001@gmail.com

Education

2019-2023

B.Tech. - Computer Engineering

Sardar Patel Institute Of Technology, Mumbai

CGPA :  9.75 / 10
Coursework : Data Structures and Algorithms, Database Management Systems, Machine Learning & Artifical Intelligence, Operating Systems, Computer Networking, Software Engineering.

2018-2019

H.S.C. (Class XII) - Science

Prakash College Of Science And Commerce, Kandivali

Percentage :  84.00 %
Coursework : Mathematics, Physics, Chemistry

2016-2017

S.S.C. (Class X)

Oxford Public School, Kandivali

Percentage :  91.60 %

Experience

July 2023 - Present

Graduate Analyst

Deutsche Bank

  • Role Overview:
    • Contributed significantly to the Anti-Financial Crime and Compliance team within a banking institution.
    • Primarily focused on developing cutting-edge applications aimed at combating financial crimes and ensuring regulatory compliance.
  • Application Development for Risk Assessment:
    • Developed bespoke applications utilized internally by the bank.
    • These applications were pivotal in assessing the risk associated with each transaction, deal, or trade processed through the institution.
    • Ensured these applications aligned with regulatory standards, contributing to a more secure and compliant banking environment.
  • Project Finsights:
    • Spearheaded a project named Finsights, a pivotal tool within the bank's toolkit.
    • Finsights was designed to convert natural language text into comprehensive diagrams and charts.
    • Leveraged datasets to translate complex information into visually accessible formats, aiding in decision-making processes within the institution.
  • NLP-based Q&A Project:
    • Played a key role in a Natural Language Processing (NLP) project centered around question-and-answer functionality.
    • Leveraged PDF documents as a primary data source for this project.
    • Implemented NLP techniques to extract information and provide insightful answers based on the content within the provided PDFs.
  • Impact and Contribution:
    • My contributions were instrumental in fortifying the bank's capabilities against financial crimes.
    • Enhanced decision-making processes by converting complex data into user-friendly formats through Finsights.
    • Enabled efficient information extraction and retrieval from PDF documents, streamlining the bank's internal processes.
  • Team Collaboration and Innovation:
    • Collaborated closely with multidisciplinary teams, showcasing strong teamwork and communication skills.
    • Demonstrated innovation by integrating technological solutions to address critical challenges within the banking sector.
  • Continuous Learning and Adaptation:
    • Engaged in ongoing learning and adaptation to stay abreast of emerging technologies and compliance standards.
    • Kept abreast of industry trends to ensure the applications and projects were at the forefront of technological advancements.

Apr 2023 - Aug 2023

GSoC Contributor

Google Summer of Code

  • Enhanced SPDX Tool Functionality: Spearheaded significant enhancements to the SPDX Online Tool, including automated pull request creation and robust error handling. These improvements streamlined the submission process and ensured a more user-friendly experience.
  • Testing and Reliability: Implemented comprehensive unit tests and conducted thorough GitHub API interaction tests using mock frameworks. This effort bolstered the tool's reliability, validating its functionality and compatibility.
  • Contributions to Efficiency: Introduced features such as the generation of .txt test files, optimizing the tool's capability for license submissions. This addition not only enhanced efficiency but also served as a valuable resource for testing and validation.
  • Documentation and Collaboration: Documented the implemented features and test procedures, contributing to the project's comprehensive documentation. Actively engaged with the project team, participating in discussions and code reviews, fostering a collaborative environment for continuous improvement.

Jan 2023 - June 2023

Research Intern

Tata Institute of Fundamental Research

  • Worked in Department of Astronomy & Astrophysics under the Seismology group under the guidance of SHRAVAN HANASOGE
  • Working uopon modelling and training machines on solar observations to predict magnetic processes such as flares and field emergence. We apply a variety of tests on such machines to try to extract the patterns they have learnt.
  • Also working on Large seismic datasets that are being extracted from high-resolution observations of the Sun – typically resulting in high-dimensional inverse problems. Appropriately conditioning and parametrising the inverse problem so as to accurately recover internal structure is another topic on which I am currently woring upon.

May 2022 - July 2022

Technology Analyst Intern

Deutsche Bank

  • Developed VS Code Server over the web which is used to replicate all the VS code functionality for multiple users simultaneously
  • Developed a utility tool which is used to directly convert GCP Diagram to Teraform Code
  • Integrated the utility tool with Code Server to increase its userbase as well as efficiency for creating the Terraform code
  • Created a unified platform wherein we can create/edit the GCP diagrams as well download the Terraform code

July 2021 - August 2021

Data Science Intern

Google

  • Worked for the Singapore Google Retail Team on various Business Intelligence Projects
  • Improved the accuracy of the Sentiment Analysis used for detecting the sentiments of the reviews across various range of products
  • Developed web scrapers used to extract reviews from a range of websites such as Amazon, BestBuy, JB Hi-Fi
  • Developed the logic for Vader Sentiment Analyses which is also capable of detecting double inverse sentiments

January 2022 - June 2022

Software Developer Intern

Credence Analytics

  • Worked on ERPNext software to understand various functionalities and components of ERP
  • Made use of Frappe to build various functionalities ranging across different modules in ERP
  • Built the pilot project for tracking progress of interns based on their performance for the HR team

July 2021 - August 2021

Research Intern

S.P. Jain Institute of Management and Research

  • Studied the current Health Infrastructure of India, across various levels such as rural, urban, district, state and union territory
  • Prepared datasets for the most prominent features across different fields of Health Infrastructure
  • Developed Infographics based on the datasets created using Tableau, which helped to connect the sea of data we had

June 2021 - August 2021

Python Developer Intern

Credence Analytics

  • Incorporated Python Code Execution by building a Flask based RESTful endpoint to maintain a consistent UI for client website
  • Developed the option of Filtering Data from datasets by again building a Flask based RESTful endpoint
  • Built a native datastore to provide search and query capabilities as a part of the existing product.

September 2020 - May 2021

Research Intern

Sardar Patel Institute of Technology

  • Implemented K-means Clustering to categorize users based on their typing speed and the use of non-conventional keys.
  • Used Data Analysis on the datasets provided to choose the best algorithm which fits our requirements.
  • Developed the SVM and Random Forests Algorithm which helped in increasing the accuracy of our system from 73% to 85%.

Skills

Python

95%

Java

85%

Machine Learning

90%

Deep Learning

90%

Django

90%

Flask

90%

NLP

85%

Solidity & WEB3

80%

Tensorflow 2

85%

Terraform

80%

HTML5

85%

CSS

80%

JavaScript

85%

MySQL & MongoDB

90%

My Achievements

GSoC Program Completion

Google Summer of Code

Aug 2023  

Successfully completed Google Summer of Code (GSoC), collaborating with SPDX to develop innovative solutions, gaining hands-on experience in SPDX License Submission. Delivered impactful contributions, leveraging cutting-edge technologies and mentorship, culminating in a successful project completion.

Winner

National Entreprenurship Challenge

Dec 2022  

Created with the idea of developing entrepreneurship from the grassroots, the aim of the competition is to plan and promote entrepreneurship among the nation's younger generation and to ensure that youngsters don't feel that starting up a company is rocket science. National Entrepreneurship Challenge plans to develop entrepreneurship developing bodies in each student campus in India. It is the first of its kind pan India challenge

Winner

Competitiveness Mindset Institute

May 2020  

FLY against COVID is an idea pitching competition with a vision to provide a platform for the young developers to portray their ideas for solving real world problems and inspire others.The aim of FLY against COVID was to come up with an idea which helps to reduce the impact of COVID-19 on different sectors viz. health, education, employment and environment.

Runner Up - S.P.I.T Hackathon 2021

Sardar Patel Institute of Technology, Mumbai

February 2021  

S.P.I.T Hackathon is a 24 hrs hackathon to find out solutions that tackle real world problems caused due to corona pandemic. We were in the top 10 teams amongst 250+ participants. Developed FixYourSelf, a complete automated web-app which is used to detect as well as correct the posture of a user sitting in front of their work station.
Developed a web-app which is used to detect and correct the posture of the user.

Second Prize - CSI Code Housie

Sardar Patel Institute of Technology, Mumbai

September 2020  

CSI Code Housie is an annual coding competing organized by CSI S.P.I.T. I came second among 125+ participants.

My Projects

Simplify

Aug 2022 - Mar 2023      Team Size: 03

Simplify stands for a smart intelligent system that can code like a human being for a data science application. It enables data scientists to perform all the tedious and time-consuming tasks such as EDA (exploratory data analysis), data cleaning, data pre-processing, data visualization, modeling, and evaluation in the data-science life cycle, by only conveying the logic of the task in natural language (English query) and the system will automatically give out all the relevant python code snippets, or in other words the user just needs to type what they want in the form of a natural language query (English), and our system will automatically give out all the relevant code snippets in python for it.

Technology Stack: React, Flask, Tensorflow, NLP

Sushruta

Sept 2021 - Nov 2021      Team Size: 03

Sushruta is a ML based responsive chatbot that is used to detect the disease for a user based on the symptoms provided. The user can chat with the chatbot to get information about disease, diagnose a disease and check symptoms. The user can also request to search for nearby hospitals. Made an Android app which is connect to a Flask based RESTFUL webserver. Used TensorFlow for formulating our ML algorithm which is then combined with our dataset of diseases, which we are scraping from National Health Registry

Technology Stack: Java, Flask, Tensorflow, XML

FixYourSelf

Dec 2020 - Feb 2021      Team Size: 03

FixYourSelf is a ML based responsive website that is used to implement posture correction for a user sitting in front of their work station. Used Flask as our backend for linking the web pages and SQLite as our database for storing the information for user’s activity. Used TensorFlow.js for formulating our ML algorithm coupled with JeelizAR for detecting and correcting the posture of user.

Technology Stack: JavaScript, Flask, Tensorflow.js, SQLite, HTML, CSS

Plagarism Checker

August – October 2020      Team Size: 02

Developed a Plagiarism Detection Web App which is used to detect content that is taken over the web without the help of any Google APIs. Used Cosine Similarity Algorithm through which, I was able to achieve an accuracy of about 88%. Used Flask as our backend for linking the web pages and SQLite as our database for storing the user’s personal information

Technology Stack: Web Scraping, Flask, SQLite, HTML, CSS ,JavaScript

Priceopedia

September - November 2020      Team Size: 03

Priceopedia is a Web Application which is used to track price of different products. Used JavaScript libraries like Chart.js to show interactive graphs, which show price range between different time periods like 1 week, 3 weeks or 1 month. Used Django as our backend for user authentication and linking web pages and HTML, CSS and JS for designing web pages and PostgreSQL as our database for storing product related information

Technology Stack: Django, PostgrSQL, HTML, CSS, JavaScript, Chart.js

My Publications

Smart Intelligent System that can Code Like a Human Being

4th International Conference for Emerging Technology (INCET) - 2023

According to recent studies, a large number of data scientists spend most of their time on tasks like data cleaning and organizing data. They need to memorize big complex syntaxes for all the major tasks in the data science life cycle. Often these tasks are redundant. Therefore, we propose to build an intelligent system that enables data scientists to perform all the tedious and time-consuming tasks such as EDA, data cleansing, data preprocessing, data visualization, modeling, and data science lifecycle evaluation. Just state the logic of your query in natural language the system will automatically output all relevant Python code snippets. Existing applications involving the text-to-code generation and code search are limited and a lot of them do not work in non-ideal conditions. The reason behind it is the data set on which the existing models have been built. These datasets do not consider real-world factors such as slang, acronyms, and paraphrases. Therefore, a new dataset was created consisting of real-world user queries, representing the scenarios a user is most likely to face daily. We plan to build a logic-oriented system that only needs to convey the logic correctly in text in natural language. It saves a lot of time, allowing data scientists to spend most of their time building logic instead of focusing on code.

Bringing Monochrome to Life: A GAN-based Approach to Colorizing Black and White Images

8th International Conference for Convergence in Technology (I2CT) - 2023

The automatic image colorization technique has garnered a lot of attention over the past ten years for a variety of applications, including the restoration of old or damaged photos. Due to the many degrees of freedom in the assignment of color information, this problem is very poorly presented. Recent advances in automatic colorization either use photographs with a recurring theme or need highly processed data, like semantic maps, as input. CNN is highly adept at classifying objects and can quickly identify skies, trees, and other vegetation, faces, and people. Even if flags and famous monuments are represented in the data sets, colorization fails to effectively depict them when applied to these items. In our method, we use a conditional Conditional Generative Adversarial Network (Conditional GAN) to try and fully generalize the colorization process. The network is trained using publicly accessible datasets like CIFAR-10.

Eye Disease Detection using MobiNet

2nd International Conference on Paradigm Shifts in Communications Embedded Systems, Machine Learning and Signal Processing (PCEMS) - 2023

The number of persons worldwide who are blind or partially impaired is close to 285 million. According to the most recent World Health Organization data, the doctor-to-patient ratio in India is approximately 0.74:1000. This enormous disparity results in treatment delays in the majority of instances. The sad thing is that disorders like diabetic retinopathy (DR) and glaucoma spread more rapidly and can result in total blindness as a result of the delayed treatment obtained. The sad part of the story is that these diseases can be cured in 75% of cases. The suggested machine learning model focuses on these elements and aids in the early diagnosis of eye disease based on the fundus scope image of the eye, which can aid in the patient’s survival. Based on the provided dataset, we used the MobiNet model to identify several eye illnesses. The experimental research verified that, when tested in various lighting circumstances, the suggested model produced improved accuracy in detecting eye illnesses. By enhancing the disease identification process, the algorithm has the potential to lessen the strain on the already overburdened healthcare system.

CNN based Deep Learning model for Deepfake Detection

2nd Asian Conference on Innovation in Technology (ASIANCON) - 2022

In the recent period there has been massive progress in synthetic image generation and manipulation which significantly raises concerns for its ill applications towards society. This would result in spreading false information, leading to loss of trust in digital content. This paper introduces an automated and effective approach to get facial expressions in videos, and especially focused on the latest method used to produce hyper realistic fake videos: Deepfake. Using faceforenc++ dataset for training our model, we achieved more that 99% successful detection rate in Deepfake, Face2Face, faceSwap and neural texture. Regular image forensics techniques are usually not very useful, because of the strong deterioration of data due to the compression. Thus, this paper follows a layered approach with first detecting the subject with the help of existing facial recognition networks followed by extracting facial features using CNN, then passing through the LSTM layer, where we make use of our temporal sequence for face manipulation between frames. Finally use of the Recycle-GAN which internally makes use of generative adversarial networks to merge spatial and temporal data.

Posture Correction and Detection using 3-D Image Classification

International Conference for Advancement in Technology - ICONAT 2022

Posture detection is quite challenging as it involves a lot of intricacies for correctly identifying the key points involved. In this paper, we analyzed existing Posture detection models and figured out the shortcomings associated with them. Half of them involve hardware, making them immovable and inconvenient for the user. The other half engages with a recognition system based on sensors that involve high costs and complex implementation processes. Scrutinizing them one after another, we developed a comprehensive hybrid model that converts a 2-dimensional scene into a 3-dimensional one, making it easier & quicker for someone sitting in front of a screen to correct their posture. This model can capture the temporal evolution of information offering insights into the user's posture at different times. It helps in achieving better results than the existing models.

Stock Market Prediction using Bi-Directional LSTM

International Conference on Communication, Information and Computer Technology - ICCICT 2021

Stock market prediction is quite challenging as the market is volatile and its direction is stochastic. The stock market gets driven by several factors like investor sentiment, economic strength, market rumors, inflation. All these aspects together make the stock market quite turbulent and hence very difficult to predict with accuracy. In this paper, we analyzed traditional Machine Learning prediction models and figured out the drawbacks associated with them. Hence we scrutinized a range of stock prediction models and finally singled out the Bi-directional Long Short-Term Memory (Bi-LSTM) neural network. It intends to find out the title role of time series by analyzing historical data of different stocks and predict stock price trends. They form a unified framework for depth and time calculation learning faster than the one-directional approach. It can capture the temporal evolution of information which allows this model to attain the best performance.

Technology Stack: Tensorflow, Sklearn, Numpy, Pandas