Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Hortonworks Exam HDPCD Topic 3 Question 57 Discussion

Actual exam question for Hortonworks's Hortonworks Data Platform Certified Developer exam
Question #: 57
Topic #: 3
[All Hortonworks Data Platform Certified Developer Questions]

You want to perform analysis on a large collection of images. You want to store this data in HDFS and process it with MapReduce but you also want to give your data analysts and data scientists the ability to process the data directly from HDFS with an interpreted high-level programming language like Python. Which format should you use to store this data in HDFS?

Show Suggested Answer Hide Answer
Suggested Answer: B

Contribute your Thoughts:

Cornell
3 months ago
XML? Are you kidding me? That's like using a sledgehammer to crack a nut. Keep it simple and go with JSON or Avro. Less markup, more data power!
upvoted 0 times
...
Verlene
3 months ago
HTML? What is this, the 90s? You want a format that's optimized for data, not web pages. Definitely not the right choice for this use case.
upvoted 0 times
Deeanna
1 months ago
F) CSV
upvoted 0 times
...
Sonia
1 months ago
E) XML
upvoted 0 times
...
Von
1 months ago
C) JSON
upvoted 0 times
...
Eun
1 months ago
B) Avro
upvoted 0 times
...
Arlene
2 months ago
A) SequenceFiles
upvoted 0 times
...
Rebbecca
2 months ago
F) CSV
upvoted 0 times
...
Rikki
2 months ago
User3: Avro or JSON would be a better choice for this use case. They are more suitable for data processing.
upvoted 0 times
...
Miss
2 months ago
E) XML
upvoted 0 times
...
Mignon
2 months ago
User2: I agree. We should consider using a format like Avro or JSON for storing data in HDFS.
upvoted 0 times
...
Vallie
2 months ago
User1: Definitely not HTML. It's not optimized for data processing.
upvoted 0 times
...
Regenia
2 months ago
C) JSON
upvoted 0 times
...
Cathrine
2 months ago
B) Avro
upvoted 0 times
...
Willard
2 months ago
A) SequenceFiles
upvoted 0 times
...
...
Vilma
3 months ago
CSV? Really? That's so old-school. You're dealing with images, not spreadsheets. Go with something more modern like Avro or JSON.
upvoted 0 times
...
Jamal
3 months ago
Avro, for sure. It's like the Swiss Army knife of data formats - it can do it all! Plus, the name just sounds cool. Who doesn't love a good 'Avro'?
upvoted 0 times
...
Pa
3 months ago
Avro is the way to go! It's optimized for large data sets and supports complex data structures. Plus, it integrates seamlessly with MapReduce and Python.
upvoted 0 times
Roslyn
2 months ago
Using Avro will make it easier for data analysts and data scientists to process the data directly from HDFS.
upvoted 0 times
...
Glen
2 months ago
It's great that Avro integrates seamlessly with MapReduce and Python.
upvoted 0 times
...
Cyril
2 months ago
Avro is the way to go! It's optimized for large data sets and supports complex data structures. Plus, it integrates seamlessly with MapReduce and Python.
upvoted 0 times
...
Jeannetta
2 months ago
F) CSV
upvoted 0 times
...
Lucina
2 months ago
E) XML
upvoted 0 times
...
Estrella
2 months ago
D) HTML
upvoted 0 times
...
Latrice
2 months ago
C) JSON
upvoted 0 times
...
Lavonda
2 months ago
B) Avro
upvoted 0 times
...
Ashton
3 months ago
A) SequenceFiles
upvoted 0 times
...
Aleisha
3 months ago
I agree, Avro is optimized for large data sets and supports complex data structures.
upvoted 0 times
...
Lavonda
3 months ago
Avro is definitely the best choice for storing data in HDFS.
upvoted 0 times
...
...

Save Cancel