Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Hortonworks Exam HDPCD Topic 3 Question 60 Discussion

Actual exam question for Hortonworks's Hortonworks Data Platform Certified Developer exam
Question #: 60
Topic #: 3
[All Hortonworks Data Platform Certified Developer Questions]

You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and data scientists the ability to process the data directly from HDFS with an interpreted high-level programming language like Python. Which format should you use to store this data in HDFS?

Show Suggested Answer Hide Answer
Suggested Answer: B

Contribute your Thoughts:

Lyda
25 days ago
If I see anyone suggest HTML or XML, I'm walking out of this exam. This is a no-brainer - Avro is the only choice that makes sense here. Unless, of course, you're into data compression that's about as effective as using a sponge to bail out a sinking ship.
upvoted 0 times
...
Leonor
30 days ago
Hey, don't knock CSV! It may be simple, but sometimes simple is best. Although, I guess if you're dealing with a ton of images, Avro might be the way to go. Gotta keep that file size down, am I right?
upvoted 0 times
Elise
4 days ago
A) SequenceFiles
upvoted 0 times
...
...
Ernest
1 months ago
I think CSV would be a good option for storing the data in HDFS as it is widely supported and easy to work with.
upvoted 0 times
...
Layla
1 months ago
I prefer using JSON because it is lightweight and easy to read.
upvoted 0 times
...
Helaine
1 months ago
Hold up, are we really considering HTML or XML for this? What is this, the 90s? Let's keep it modern and efficient, folks. Avro or bust!
upvoted 0 times
Remedios
22 hours ago
Agreed, Avro is definitely the way to go for storing data in HDFS.
upvoted 0 times
...
Viola
3 days ago
Avro is definitely the most suitable format for this task. It's the best option for our needs.
upvoted 0 times
...
Alayna
4 days ago
I think Avro is a great choice for storing data in HDFS. It's efficient and modern.
upvoted 0 times
...
Vallie
18 days ago
Agreed, HTML and XML are definitely outdated for this purpose. Avro is the way to go.
upvoted 0 times
...
...
Willodean
1 months ago
I agree with Glenna, Avro is a good choice because it supports schema evolution.
upvoted 0 times
...
Kassandra
2 months ago
I'm partial to JSON myself. It's human-readable, integrates well with Python, and is a pretty standard format for web-based data. But I can see the benefits of Avro too.
upvoted 0 times
Tyisha
27 days ago
User 2
upvoted 0 times
...
Dudley
1 months ago
User 1
upvoted 0 times
...
...
Wynell
2 months ago
Hmm, this seems like a no-brainer to me. Avro is the way to go for processing images with MapReduce and Python. It's efficient, compact, and supports complex data structures.
upvoted 0 times
Domingo
1 months ago
I agree, Avro is the most efficient format for this type of data.
upvoted 0 times
...
Paz
1 months ago
Avro is definitely the best choice for storing image data in HDFS.
upvoted 0 times
...
...
Glenna
2 months ago
I think we should use Avro to store the data in HDFS.
upvoted 0 times
...

Save Cancel