microsoft / AI-Chat-App-Hack

HackTogether: The Microsoft Python Chatbot Hack | Register, Hack, Win
MIT License
216 stars 42 forks source link

Project: Redefining RAG: Azure Document Intelligence + Azure CosmosDB Mongo vCore #103

Open Divakar-kumar opened 4 months ago

Divakar-kumar commented 4 months ago

Project name

CosmicTalent

Description

About

CosmicTalent, an application designed to empower HR and managers in effectively navigating employee information and efficiently filtering or identifying eligible employees based on specific task requirements.

Inspiration

Currently in Service-based companies, it is highly challenging to find and retain the top talents to meet the needs of their clients. Traditional method of choosing a talent for client requirement is not efficient, as it is tend to be lengthy process and would have mismatches between the candidate and roles. Also it is difficult to understand the expectations of the talent being hired.

However with our CosmicTalent app, now companies could make use or Vector search capabilities to better retrieve employees for a specific requirement and also know about their employees better. This will help onboarding the best-fit candidates for the client's requirement and also improve retainment of talents by understanding their unique capabilities more efficiently.

PS: Another flavor of this app could be used for Talent selection . Gone are the days of manual resume screening. Now with our app we have option to onboard best-fit talents

Architecture

Data Flow

Document Intelligence Process

Resume Upload Process

Copilot Process

Refinement of RAG flow

GetResumeEmbeddingChunks Code

Image description

Image description

Custom Summarization Logic

Better results after Refinement of RAG flow

The initial version of our application did not consider any chunking patterns and generated vector embeddings for the entire content within the Employee collection. Here were the issues encountered before refining the RAG flow:

Challenges before RAG Flow Refinement

1. Not able to retrieve information effectively

Image description

2. Token exceed limitations

Image description

Results After RAG Flow Refinement

1. Able to retrieve information effectively

Image description

2. Improvements in overall Token consumption

Image description

Language

English

Project Repository URL

https://github.com/Cloud-Jas/CosmicTalent

Deployed Endpoint URL

https://chatapp.victorioussky-69f01f3f.centralindia.azurecontainerapps.io/

Project video

https://youtu.be/XYm3c7_B0W0

Team members

Divakar-kumar

Showcase Consent

Yes

Divakar-kumar commented 4 months ago

PS : Deployed version instances will be scaled down to zero in case of no usage. So expect some delays :) Currently, Divakar Kumar (FEC0120 -Backend), Sakthivel Ganesan (FEC002-Backend) and Thamodaran Kumar (FEC0005-Frontend) are the employees registered in the system.

DeonEmyr commented 2 months ago

Can I ask what the relative benefit of using these components vs using Azure AI Search

Divakar-kumar commented 2 months ago

@DeonEmyr It depends on the level of flexibility you might require, If you already have MongoDB as your transactions store , we could use same database for vector search capabilities as well. However AI search would be a better choice if you would require information from multiple data sources and perform hybrid searches + semantic rank features.