After several months and experience with NFS and HDFS, I can answer my question:
NFS allows you to view / modify files on remote computers as if they were saved on a local computer. HDFS can also do this, but it is distributed (unlike NFS), as well as fault tolerant and scalable.
The advantage of using NFS is its ease of configuration, so I would probably use it for QA environments or small clusters. The advantage of HDFS is, of course, its fault tolerance, but the biggest advantage, IMHO, is the ability to use locality when HDFS is shared with Spark nodes, which provides better performance for breakpoints, shuffling, etc.
kerkero
source share