Read the following
A comprehensive reading is required of the following
Last updated
Was this helpful?
A comprehensive reading is required of the following
Last updated
Was this helpful?
.
.
Hive and Dataproc and how it works on Google cloud
How the workers can download dependencies from internet.
How Cloud Datastore and Spanner are different
When to use spanner vs Bigtable
2 PB where will you store -> Datastore, spanner, Bigtable
1 TB where would you store
KafkaIO
Storage transfer service vs transfer appliance. Can you use a private web address to transfer 2 PB of data over six months with Storage transfer service?
Migrating from on-premises to GCP. Fault tolerant method?
Do you copy the files to GCS and run dataproc and changes references hdfs:// to gs://?
Partitioning for each export timestamp to decrease the query bill? (timestamp, ID are used for input)
Data analysts are querying last 30 days more. You want to decrease cost immediately. Data arrives once each day?
copy the last 30 files to gcs do analysis over them?
Export 30 days data to Datastudio?
create a table/partition with last 30-60 days data?
Dataflow template vs DAG on composer for running spark some of the jobs in sequences and some concurrent?
AUC under curve 1.05. How would you improve the Area under the curve?
fluentd in_tail
plugin vs Mysql
plugin for MariaDB with Stack driver agent?
Cloud vision API have damage detection without training?
Cloud ML vs Dataproc spark ML from existing spark ML models. and Can we pull data from BigQuery directly?
Increase availability: For SQL like querying, storage for over years, streaming
Add data in GCS in multi-region. Pull data by BigQuery, Dataflow, Cloud storage
Pubsub
Kubeflow