Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any plan to port TensorframeOnSpark(From yahoo) ? #52

Open
allwefantasy opened this issue Sep 28, 2017 · 2 comments
Open

Is there any plan to port TensorframeOnSpark(From yahoo) ? #52

allwefantasy opened this issue Sep 28, 2017 · 2 comments

Comments

@allwefantasy
Copy link

TensorframeOnSpark support the real distribute trainning on tensorflow . Maybe it's a good idea to port TensorframeOnSpark to spark deep learning project.

@sueann
Copy link
Collaborator

sueann commented Sep 28, 2017

Thanks, @allwefantasy, that makes a lot of sense. In fact, we've been collaborating with the TensorFlowOnSpark (TFoS) team to see how we can bring all these ideas together...! They have implemented a Spark MLlib Estimator in the TFoS project (https://github.com/yahoo/TensorFlowOnSpark/blob/master/tensorflowonspark/pipeline.py#L214) which is very exciting. The two projects are quite complementary currently, and it could make sense to combine them together. What are your thoughts?

@allwefantasy
Copy link
Author

Excellent, that's really what I want. Also TFEstimator implemented by TFoS proves it is easy to be a distributed runtime for spark-deep-learning. That means we can just treat TFoS as pip dependency.
I check the last PR in spark-deep-learning is KerasImageFileEstimator, we can add a new param eg. distribute_mode to make it run in TFoS.

class TFImageFileEstimator(Estimator, HasInputCol, HasInputImageNodeName,
                              HasOutputCol, HasOutputNodeName, HasLabelCol,
                              HasKerasModel, HasKerasOptimizer, HasKerasLoss,
                              CanLoadImage, HasOutputMode, DistributeMode="TFoS"):

spark-deep-learning depends on tensorframes which makes spark can use tf's power of numerical calculation and connects tf / spark strongly, and we can provide users with a perfect experience like MLlib. we don't encourage people to write complex TFcode . People should just define their network structure or tell us their model file and input/output,then everything should been done by spark-deep-learning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants