Bush files on hdf pages that are not deleted when the management table is deleted (not external) are deleted

When I delete a managed table from the Hive interactive command line, the base files created in hdfs in /user/hive/warehouse/<databasename>.dbstill exist. This causes problems when I recreate a table with the same name and try to execute

INSERT INTO TABLE 

since it still contains the data that I loaded into these sections (dt and hr sections in my case) in my original order. Only if I use

INSERT OVERWRITE TABLE

then it will finally load the data correctly, but my ETL should use INSERT INTO TABLE.

Any ideas? I'm only ready to create the same table, but with a different name, or just go in and delete the material in hdfs, but I'm worried if this breaks the metastor or something like that. Lastly, I'm sure this is a managed table, not an external one.

+4
source share
1 answer

Sometimes Hive deletes table metadata, but silently moves files to the trash. Have you checked permissions on /user/<user>/.Trash? Verify that the ETL user has the correct permission for this folder.

+2
source

All Articles