Hello. I'm new to this community. 🙂
I'm in the ideation phase of a B2B product, but I have questions about which Google Cloud tools I would need to use to build my app.
My app: my intention is to build an application that can scan product labels and identify whether they follow the guidelines of regulatory bodies (e.g. FDA). This involves identifying texts and their sizes, identifying stamps and logos. The intention is that I can train my evaluation system with the regulatory body's rules, so that it can read the label and return insights to the user.
Well, that's basically the intention. I was researching and saw that Google's OCR is a start, but they have a few different solutions, such as Document AI and Cloud Vision, but I wasn't sure which of these would be right for my use case.
I would be very grateful if any community members could share their experience and guide me to the correct tool.
I hope I can do the same for the community in the near future. 🙂
Solved! Go to Solution.
Hi @diegodamaceno,
Welcome to the Community! Excited to hear that you're interested in learning more about the Google Cloud ecosystem and looking forward to seeing you get get engaged across the forums. GCC is a community of communities. You'll notice that we have dedicated forum spaces for all kinds of topics including Data Analytics, AI/ML, no-code app development and Google Cloud Computing.
Take a look around and check out some of the conversations in progress and add your voice. As a startup, you're in the right place! There are also some great programs to look into over at startup.google.com when you get a chance. I am not an expert, but I can definitely explore some internal resources with you and help point you in the direction of the best Google Cloud tools to suit your project needs!
Here's a breakdown of the options and why they could be relevant:
Core Services
Cloud Vision API: This is a great starting point. It provides powerful image analysis features, including:
Document AI: If your product labels contain structured or semi-structured information (like tables or forms), Document AI is a more specialized tool. It's designed to:
How to Decide:
To choose the most suitable one, consider these questions:
Complementary Services
Workflow Suggestion
Note: Building a robust application to accurately interpret regulatory guidelines will likely involve iterative model training and fine-tuning.
Let me know if you'd like to dive deeper into any of these aspects or discuss some of the training strategies for your evaluation system!
Also, if you haven't already, take a look at the the Google Cloud Innovators Program. Some of the benefits include access to exclusive learning resources and other member's only opportunities to help you grow as Cloud Professional. Let me know if you need help navigating that!
Be sure to sure to check out our Learning & Certification Hub where members share best practices on preparing for certifications, stay up to date on what’s next with Google Cloud, and network with similar goals.
Keep us posted on your progress!
Hi @diegodamaceno,
Welcome to the Community! Excited to hear that you're interested in learning more about the Google Cloud ecosystem and looking forward to seeing you get get engaged across the forums. GCC is a community of communities. You'll notice that we have dedicated forum spaces for all kinds of topics including Data Analytics, AI/ML, no-code app development and Google Cloud Computing.
Take a look around and check out some of the conversations in progress and add your voice. As a startup, you're in the right place! There are also some great programs to look into over at startup.google.com when you get a chance. I am not an expert, but I can definitely explore some internal resources with you and help point you in the direction of the best Google Cloud tools to suit your project needs!
Here's a breakdown of the options and why they could be relevant:
Core Services
Cloud Vision API: This is a great starting point. It provides powerful image analysis features, including:
Document AI: If your product labels contain structured or semi-structured information (like tables or forms), Document AI is a more specialized tool. It's designed to:
How to Decide:
To choose the most suitable one, consider these questions:
Complementary Services
Workflow Suggestion
Note: Building a robust application to accurately interpret regulatory guidelines will likely involve iterative model training and fine-tuning.
Let me know if you'd like to dive deeper into any of these aspects or discuss some of the training strategies for your evaluation system!
Also, if you haven't already, take a look at the the Google Cloud Innovators Program. Some of the benefits include access to exclusive learning resources and other member's only opportunities to help you grow as Cloud Professional. Let me know if you need help navigating that!
Be sure to sure to check out our Learning & Certification Hub where members share best practices on preparing for certifications, stay up to date on what’s next with Google Cloud, and network with similar goals.
Keep us posted on your progress!
For your product idea, Google Cloud Vision API is a great starting point. It offers OCR for text extraction and label detection for identifying logos and stamps on product labels. It's efficient and easy to integrate. As your project evolves, you can explore Document AI for more advanced document processing features or develop custom ML models for specific regulatory requirements.