Are Google Cloud Storage files indexed in search? - google-cloud-storage

If I upload some files into a Google Cloud Storage bucket and made them public does Google index them in search results?

Google's search index is independent of Google Cloud Storage (GCS). Making a file public in GCS does not automatically index it in Google's search results.
So, public files in GCS will not automatically be indexed by Google.

Related

Cloud Api Product Search asked for storing the images in the google cloud storage, but can i store them in the firebase storage as alternative?

I'm using the Cloud Api product search, they ask me to store the images and the csv file in the google cloud storage but i'm already using firebase in my project and my images stored there so can i choose firebase than the google cloud ?
Yes, you can use it as an alternative. As Renaud Tarnec commented, Firebase Storage underneath is Google Cloud Storage. This means that any bucket in your Firebase console is actually a regular Google Cloud Storage bucket.
You can confirm this by creating a Storage Bucket in the Firebase console and uploading objects to it (images for your dataset and the CSV file). If you access the Cloud Storage page in the GCP console, the same bucket and objects will be there.
As for using the Vision Product Search API with Firebase Storage, I followed this quickstart guide, but edited the CSV file to point to images that were uploaded to the default Firebase Storage bucket (PROJECT-ID.appspot.com).
After letting the dataset images index, I was then able to send an image to be compared to the dataset as expected.
Keep in mind the service account you use to create a Vision API dataset must be able to read Cloud Storage Objects to access the images in your Firebase Storage bucket. Let me know if this was useful.

how to search text in json file that Google vision api created from pdf

Is there any way to search text in json files that Google vision api created from pdf.
searching of text should be happen over Google cloud storage only
Google Cloud Storage is an Object based storage solution that does not provide processing features. In order to perform any process job over the Cloud Storage data you would need a computing/processing solution, and I’d opt for a serverless option such as Cloud Functions.
I’ve found at the Cloud Functions Docs a sample application that integrates several APIs with Cloud Functions and Cloud Storage, I think you can use it as a guideline to develop your own setup.
Once you have the mentioned setup you could apply a regex implementation to search for the desired data, how to implement it will depend on the runtime, libraries and technologies that you choose to use.

Generate single download url for multiple files in Google Cloud Storage

I have my files residing in Google Cloud Storage bucket and I am using the java API to upload files there. In my application, for single file downloads, I am generating signed URLs but I want to provide a functionality where users can select multiple files and download them as a zip. Is there a way to achieve this with the current API?
This is not possible with the current API. You could do it yourself with a fairly simple AppEngine app though.

Is Google Cloud Storage indexed in search?

If a file is placed into Google Cloud storage and made public, but a URL to the file does not exist on another webpage, does Google index it in its search results anyway? Anyone know?
Google's search index is independent of its cloud storage. Making a file public in cloud storage does not automatically index it in Google's search results.
However, asking this question leads me to believe that you're probably wondering if it's okay to make a file public and be reasonably sure that the file is not accessible to anyone. If the files are of any sensitive nature to you or your users, this is not the right solution.
If you're using GCS on a website and want to serve files securely, you might want to try the Signed URLs option. If it's just about being indexed, you could add a robots.txt file to the root of your bucket that excludes the file from being indexed.
If you have a public file in Google's cloud storage, the URL to that file would have to be indexed, otherwise the servers would have no way of finding it.
The same thing happens with Google Drive. A public URL is still indexed, and is still available within the server's lookup tables, even if a hard link doesn't exist anywhere on the web.

List of public Google Cloud Storage buckets

Is there a way to get a list of public buckets on Google Cloud Storage?
I can't find anything on this subject in the docs: http://code.google.com/apis/storage/docs/gsmanager.html
Update: I recently saw a public google bucket containing patent documents. I was wondering if there is a publicly accessible list of public buckets.
You can list your own buckets but there is currently no way to get a list all (public or private) buckets. This is probably good for security. If you wanted to know if a particular bucket is available, you can always try creating it and seeing if it fails. Can you provide more details on your use case?