Well, the missing profile features in Jupyter also puzzled me in the past, although for a different reason - I wanted to be able to switch between different deep learning systems (Theano and TensorFlow) on demand; I eventually found a solution (described in my blog post here ).
The fact is that although there are no profiles in Jupyter, the startup files for the IPython kernel still exist, and since Pyspark uses this particular kernel, it can be used in your case.
, Pyspark Jupyter, , , script init_spark.py :
from pyspark import SparkConf, SparkContext
conf = SparkConf().setMaster("yarn-client")
sc = SparkContext(conf = conf)
~/.ipython/profile_default/startup/ .
, sc Jupyter:
In [1]: sc
Out[1]:<pyspark.context.SparkContext at 0x7fcceb7c5fd0>
In [2]: sc.version
Out[2]: u'2.0.0'
Apache Toree ( ), ( , ).