[SPARK-10914] UnsafeRow serialization breaks when two machines have different Oops size. - spark

diff options

author	Reynold Xin <rxin@databricks.com>	2015-10-08 17:25:14 -0700
committer	Reynold Xin <rxin@databricks.com>	2015-10-08 17:25:14 -0700
commit	84ea287178247c163226e835490c9c70b17d8d3b (patch)
tree	1e01cf3ce4db65842b0685d55e954e089a8ddf68 /python
parent	02149ff08eed3745086589a047adbce9a580389f (diff)
download	spark-84ea287178247c163226e835490c9c70b17d8d3b.tar.gz spark-84ea287178247c163226e835490c9c70b17d8d3b.tar.bz2 spark-84ea287178247c163226e835490c9c70b17d8d3b.zip

[SPARK-10914] UnsafeRow serialization breaks when two machines have different Oops size.

UnsafeRow contains 3 pieces of information when pointing to some data in memory (an object, a base offset, and length). When the row is serialized with Java/Kryo serialization, the object layout in memory can change if two machines have different pointer width (Oops in JVM). To reproduce, launch Spark using MASTER=local-cluster[2,1,1024] bin/spark-shell --conf "spark.executor.extraJavaOptions=-XX:-UseCompressedOops" And then run the following scala> sql("select 1 xx").collect() Author: Reynold Xin <rxin@databricks.com> Closes #9030 from rxin/SPARK-10914.

Diffstat (limited to 'python')

0 files changed, 0 insertions, 0 deletions


context:
space:
mode: