08 November 2020

#Apache Hive

Hive
What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
What are the types of database does Hive support ?
What is the Object Inspector functionality is in Hive?
What are limitations of Hive?
What are the different Modes in the Hive?
What is Hive Bucketing?
What is the difference between partition and bucketing?
What does the Hive query processor do?
What is the difference between local and remote metastore?
What is ObjectInspector functionality in Hive?
What is Hive MetaStore?
What are the three different modes in which hive can be run?
What are the different components of a Hive architecture?
What is a Hive variable? What do we use it for?
What is SerDe in Apache Hive?
What is RegexSerDe?
What is the significance of ‘IF EXISTS” clause while dropping a table?
What is the relationship between MapReduce and Hive? or How Mapreduce jobs submits on the cluster?
What is ObjectInspector functionality?
What are the default record and field delimiter used for hive text files?
What is difference between static and dynamic partition of a table?
What is the default maximum dynamic partition that can be created by a mapper/reducer? How can you change it?
What Options are Available When It Comes to Attaching Applications to the Hive Server?
What are the uses of Hive Explode?
What is the difference between Internal and External Table?
What is a metastore in Hive?
What is the functionality of Query Processor in Apache Hive?
What is the utilization of Hcatalog?
What is the importance of Thrift server & client, JDBC and ODBC driver in Hive?
Explain the SMB Join in Hive?
Explain about SORT BY, ORDER BY, DISTRIBUTE BY and CLUSTER BY in Hive.
Explain the functionality of ObjectInspector.
Explain about the different types of partitioning in Hive?
Explain the different types of join in Hive.
Why will mapreduce not run if you run select * from table in hive?
Why does Hive not store metadata information in HDFS?
Why do we perform partitioning in Hive?
Why do we need buckets?
Which classes are used in Hive to Read and Write HDFS Files?
How HIVE is different from RDBMS?
How data transfer happens from HDFS to Hive?
How does bucketing help in the faster execution of queries?
How can you prevent a large job from running for a long time?
How can you connect an application, if you run Hive as a server?
How can you stop a partition form being queried?
How does Hive deserialize and serialize the data?
How can Hive avoid mapreduce?
How does partitioning help in the faster execution of queries?
How is ORC file format optimised for data storage and analysis?
How will you optimize Hive performance?
When do we use explode in Hive?
When should we use SORT BY instead of ORDER BY?
When to use Map reduce mode?
Where does the data of a Hive table gets stored?
In Hive, how can you enable buckets?
In case of embedded Hive, can the same metastore be used by multiple users?
Can we LOAD data into a view?
Can Hive process any type of data formats?
Can we change the data type of a column in a hive table?
Can a partition be archived? What are the advantages and disadvantages?
Can you list few commonly used Hive services?
Can we name view the same as the name of a Hive table?
Can we run UNIX shell commands from Hive? Can Hive queries be executed from script files? If yes, how?
does the archiving of Hive tables save space in HDFS?
does Hive support record level Insert, delete or update?
Is Hive suitable to be used for OLTP systems? Why?
Is it possible to add 100 nodes when we already have 100 nodes in Hive? If yes, how?
Suppose that I want to monitor all the open and aborted transactions in the system along with the transaction id and the transaction state. Can this be achieved using Apache Hive?
Whenever we run a Hive query, a new metastore_db is created. Why?
While loading data into a hive table using the LOAD DATA clause, how do you specify it is a hdfs file and not a local file ?

No comments:

Post a Comment

Most views on this month