join on different data type
2015-05-06 15:06
330 查看
SELECT ....
FROM A LEFT SEMI JOIN B
ON (A.col1 = B.col2)
WHERE ...
"
If A.col1 is of DOUBLE type, but B.col2 is of BIGINT, will print WARNING: Comparing a bigint and a double may result in a loss of precision. Why can't cast col2 to double automatically?
> If A.col1 is of DOUBLE type,
> but B.col2 is of BIGINT,
The automatic conversion is not acceptable according to the java language
spec (section 5.1.2)
https://docs.oracle.com/javase/specs/jls/se7/html/jls-5.html#jls-5.1.2
Also to be noted here is that in general, that even if you cast, you might
be casting the wrong way around.
Because joins on double columns will give incorrect (rather unintended,
but IEEE 754 correct) results when comparing byte serialized
representations - because of the nearly-equal property epsilon.
Easiest way to demonstrate this is to try the simplest off-by-epsilon case
(say, in python)
>>> import sys
>>> 0.1 + 0.2
0.30000000000000004
>>> 0.1 + 0.2 > 0.3
True
>>>
>>> ((0.1+0.2) - 0.3) < sys.float_info.epsilon
True
So if the RHS produced ETL values by sum() and the LHS was produced by
parsing log text, the JOIN will output zero rows.
If you want to do equijoins like that, the only valid case is to cast both
to fixed precision bigints (say, convert all dollars to cents, by *100),
not both to double.
Cheers,
Gopal
FROM A LEFT SEMI JOIN B
ON (A.col1 = B.col2)
WHERE ...
"
If A.col1 is of DOUBLE type, but B.col2 is of BIGINT, will print WARNING: Comparing a bigint and a double may result in a loss of precision. Why can't cast col2 to double automatically?
> If A.col1 is of DOUBLE type,
> but B.col2 is of BIGINT,
The automatic conversion is not acceptable according to the java language
spec (section 5.1.2)
https://docs.oracle.com/javase/specs/jls/se7/html/jls-5.html#jls-5.1.2
Also to be noted here is that in general, that even if you cast, you might
be casting the wrong way around.
Because joins on double columns will give incorrect (rather unintended,
but IEEE 754 correct) results when comparing byte serialized
representations - because of the nearly-equal property epsilon.
Easiest way to demonstrate this is to try the simplest off-by-epsilon case
(say, in python)
>>> import sys
>>> 0.1 + 0.2
0.30000000000000004
>>> 0.1 + 0.2 > 0.3
True
>>>
>>> ((0.1+0.2) - 0.3) < sys.float_info.epsilon
True
So if the RHS produced ETL values by sum() and the LHS was produced by
parsing log text, the JOIN will output zero rows.
If you want to do equijoins like that, the only valid case is to cast both
to fixed precision bigints (say, convert all dollars to cents, by *100),
not both to double.
Cheers,
Gopal
相关文章推荐
- mount: wrong fs type, bad option, bad superblock on 125.64.41.244:/data/img
- ajax上传图片TypeError: 'append' called on an object that does not implement interface FormData.
- different user control type: reuse layout vs reuse data
- EF6 Create Different DataContext on runtime(运行时改变连接字符串)
- Practices on Umbraco DataType Development
- LinkageError之loader (instance of xxx) previously initiated loading for a different type with name "lib/MyData"
- join......on 后面的and 和where的区别
- oracle11g ORA-01555 ON ACTIVE DATA GUARD
- 再说WCF Data Contract KnownTypeAttribute
- enctype="multipart/form-data"的表单无法获取表单中除了type=file以外的其他参数 commons-fileupload 获取除file外其他参数
- How to tell RNA-seq library type of strand-specific for RNA-seq data (for reads mapping by Tophat)
- Update-TypeData 帮助信息
- 关于input标签带有enctype="multipart/form-data"而导致getParameter获取不到值的解决方法2
- Description Resource Path Location Type web.xml is missing and <failOnMissingWebXml> is set to true
- javax.el.PropertyNotFoundException: Property 'title' not found on type java.lang.String
- sql语法:inner join on, left join on, right join on详细使用方法
- inner join on, left join on, right join on
- jQuery dataType指定为json的问题
- inner join on, left join on, right join on
- XML Data Type Methods(一)