Skip to content

Is there any plan to port TensorframeOnSpark(From yahoo) ? #52

Open
@allwefantasy

Description

@allwefantasy

TensorframeOnSpark support the real distribute trainning on tensorflow . Maybe it's a good idea to port TensorframeOnSpark to spark deep learning project.

Activity

sueann

sueann commented on Sep 28, 2017

@sueann
Collaborator

Thanks, @allwefantasy, that makes a lot of sense. In fact, we've been collaborating with the TensorFlowOnSpark (TFoS) team to see how we can bring all these ideas together...! They have implemented a Spark MLlib Estimator in the TFoS project (https://github.com/yahoo/TensorFlowOnSpark/blob/master/tensorflowonspark/pipeline.py#L214) which is very exciting. The two projects are quite complementary currently, and it could make sense to combine them together. What are your thoughts?

allwefantasy

allwefantasy commented on Sep 29, 2017

@allwefantasy
Author

Excellent, that's really what I want. Also TFEstimator implemented by TFoS proves it is easy to be a distributed runtime for spark-deep-learning. That means we can just treat TFoS as pip dependency.
I check the last PR in spark-deep-learning is KerasImageFileEstimator, we can add a new param eg. distribute_mode to make it run in TFoS.

class TFImageFileEstimator(Estimator, HasInputCol, HasInputImageNodeName,
                              HasOutputCol, HasOutputNodeName, HasLabelCol,
                              HasKerasModel, HasKerasOptimizer, HasKerasLoss,
                              CanLoadImage, HasOutputMode, DistributeMode="TFoS"):

spark-deep-learning depends on tensorframes which makes spark can use tf's power of numerical calculation and connects tf / spark strongly, and we can provide users with a perfect experience like MLlib. we don't encourage people to write complex TFcode . People should just define their network structure or tell us their model file and input/output,then everything should been done by spark-deep-learning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

      Development

      No branches or pull requests

        Participants

        @allwefantasy@sueann@phi-dbq

        Issue actions

          Is there any plan to port TensorframeOnSpark(From yahoo) ? · Issue #52 · databricks/spark-deep-learning