Vignana-Jyothi / kp-gen-ai

MIT License
0 stars 0 forks source link

[Theory] portal.vision.cognitive.azure.com #20

Open head-iie-vnr opened 2 days ago

head-iie-vnr commented 2 days ago

Azure Open AI is services from Auzure platform for Generative AI

Here we can perform search I has APIs using REST, we can

Dense Captions. : For every Item detected in the image, it can generate caption Background Removal

Microsoft is doing

Ideas:

Similar ones are available in HuggingFace.

head-iie-vnr commented 2 days ago

The Azure Vision Studio portal at portal.vision.cognitive.azure.com supports a wide range of functionalities leveraging Azure's AI Vision capabilities. These functionalities include:

  1. Optical Character Recognition (OCR): Extracts printed and handwritten text from images and documents, supporting various languages and writing styles. This is useful for digitizing business documents, invoices, receipts, posters, business cards, letters, and whiteboards.

  2. Image Analysis: Provides detailed insights from images, including detecting objects, faces, adult content, and generating auto-generated text descriptions. It can also tag images, categorize them, and identify brands and celebrities.

  3. Face Recognition: Detects, recognizes, and analyzes human faces in images. This can be used for identity verification, touchless access control, and privacy features like face blurring.

  4. Video Analysis: Includes capabilities such as spatial analysis and video retrieval. Spatial analysis tracks the presence and movement of people in real-time, while video retrieval enables searching within video content using natural language queries.

  5. Custom Vision: Allows users to train custom image classification and object detection models with minimal images, enabling tailored AI solutions without extensive machine learning expertise.

  6. Digital Asset Management: Enhances the organization, storage, retrieval, and management of digital media assets by grouping and identifying images based on logos, faces, objects, colors, and generating searchable keywords and captions.

  7. Spatial Analysis: Analyzes environments in real-time to understand people’s movements and presence, useful for scenarios like crowd management and safety monitoring.

  8. Responsible AI: Provides guidelines and tools to ensure the ethical and accurate use of AI Vision capabilities.

These features collectively enable robust image and video analysis, making Azure Vision Studio a powerful tool for developers and businesses looking to integrate advanced visual recognition and analysis into their applications

head-iie-vnr commented 2 days ago

The free tier of Azure Computer Vision API (F0) offers the following quotas and rate limits:

  1. Transactions Per Month: 5,000 free transactions per month.
  2. Transactions Per Minute: Up to 20 transactions per minute.
  3. Rate Limit: 20 calls per minute, which translates to a maximum of approximately 0.33 transactions per second.

For more detailed limits and pricing information, you can check the Azure Computer Vision pricing page and the Azure AI Vision documentation

head-iie-vnr commented 2 days ago

Expertiment : Text extract from Image

https://portal.vision.cognitive.azure.com/demo/extract-text-from-images

Telugu is supported

Tried to upload below image.

WhatsApp Image 2024-05-09 at 5 58 06 PM

Normal Users Get $200 free credit toward Azure products and services, plus 12 months of popular free services.

Education Users: Students 18 and up can get $100 in free credits. Get software, templates, and the resources to build custom apps in the cloud.