针对HIVE的自定义记录分隔符(Custom Record Delimiter for HIVE)
对于Hive版本 - 0.14我们可以提供自定义记录分隔符“\ r \ n \ n \ n”而不是默认值' [ "\r" , "\n", "\r\n" ]
因此,在我的情况下,由于默认行分隔符,2行在HIVE中变为4行,而我需要“\ r \ n \ n \ n”作为行分隔符。
For Hive version - 0.14 Can we provide a custom record delimiter "\r\r\n" instead of defaults ' [ "\r" , "\n", "\r\n" ]
As a result, in my case 2 lines become 4 lines in HIVE because of default line separators whereas I needed "\r\r\n" to be line separator.
最满意答案
虽然有自定义字段分隔符org.apache.pig.piggybank.storage.MyRegExLoader,但是对于自定义记录分隔符,使用PIG将换行符转换为null并使用换行符作为记录分隔符
Though there is custom field delimiter org.apache.pig.piggybank.storage.MyRegExLoader , for custom record delimiter converted newlines to null using PIG and used newline as record delimiter
更多推荐
发布评论