Closed scarvajalg closed 2 years ago
Hey @scarvajalg, thank you for reporting this.
Taking a quick look at the error messages, there seems to be something amiss with sqlite, and therefore it could be something to do with experiment tracking. Are you trying to enable that feature in Kedro-Viz?
If so, have you followed the usage instructions in the readme in order to enable it?
Hi!
I fix the error with sqlite. Now kedro-viz works but I can not visualize the pipelines:
Kedro-viz is empty. Why kedro-viz can't recognize the datasets and pipelines?
2022-02-23 09:59:39,487 - numexpr.utils - INFO - NumExpr defaulting to 4 threads.
Using Spark's default log4j profile: org/apache/spark/log4j-defaults.properties
Setting default log level to "WARN".
To adjust logging level use sc.setLogLevel(newLevel). For SparkR, use setLogLevel(newLevel).
22/02/23 09:59:53 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
2022-02-23 10:00:05,973 - kedro_viz.integrations.pypi - INFO - Checking for update...
2022-02-23 10:00:06,580 - kedro.framework.session.store - INFO - `read()` not implemented for `BaseSessionStore`. Assuming empty store.
/Users/scarvajalg/PycharmProjects/data_engineering/venv/lib/python3.8/site-packages/kedro/framework/context/context.py:32: DeprecationWarning: Accessing package_name via the context will be deprecated in Kedro 0.18.0.
warn(
/Users/scarvajalg/PycharmProjects/data_engineering/venv/lib/python3.8/site-packages/kedro/framework/context/context.py:488: UserWarning: Credentials not found in your Kedro project config.
No files found in ['/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/conf/base', '/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/conf/local'] matching the glob pattern(s): ['credentials*', 'credentials*/**', '**/credentials*']
warn(f"Credentials not found in your Kedro project config.\n{str(exc)}")
/Users/scarvajalg/PycharmProjects/data_engineering/venv/lib/python3.8/site-packages/hdfs/config.py:15: DeprecationWarning: the imp module is deprecated in favour of importlib; see the module's documentation for alternative uses
from imp import load_source
/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/src/data_modeling/pipelines/analytics/mod_prop_veh/premaster/nodes.py:229: DeprecationWarning: invalid escape sequence \]
(F.split(F.split("rango_monto", "[<\]]")[1], ",")[0]).alias("lower_limit"),
/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/src/data_modeling/pipelines/analytics/mod_prop_veh/premaster/nodes.py:230: DeprecationWarning: invalid escape sequence \]
(F.split(F.split("rango_monto", "[<\]]")[1], ",")[1]).alias("high_limit"),
/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/src/data_modeling/pipelines/analytics/mod_prop_veh/premaster/nodes.py:262: DeprecationWarning: invalid escape sequence \]
(F.split(F.split("rango_monto", "[<\]]")[1], ",")[0]).alias("lower_limit"),
/Users/scarvajalg/PycharmProjects/data_engineering/data-pipelines/data_modeling/src/data_modeling/pipelines/analytics/mod_prop_veh/premaster/nodes.py:263: DeprecationWarning: invalid escape sequence \]
(F.split(F.split("rango_monto", "[<\]]")[1], ",")[1]).alias("high_limit"),
INFO: Started server process [4621]
2022-02-23 10:00:16,828 - uvicorn.error - INFO - Started server process [4621]
INFO: Waiting for application startup.
2022-02-23 10:00:16,832 - uvicorn.error - INFO - Waiting for application startup.
INFO: Application startup complete.
2022-02-23 10:00:16,836 - uvicorn.error - INFO - Application startup complete.
INFO: Uvicorn running on http://127.0.0.1:4141 (Press CTRL+C to quit)
2022-02-23 10:00:16,843 - uvicorn.error - INFO - Uvicorn running on http://127.0.0.1:4141 (Press CTRL+C to quit)
INFO: 127.0.0.1:49466 - "GET / HTTP/1.1" 200 OK
INFO: 127.0.0.1:49466 - "GET /static/css/2.8509df91.chunk.css HTTP/1.1" 200 OK
INFO: 127.0.0.1:49468 - "GET /static/js/2.610eb610.chunk.js HTTP/1.1" 200 OK
INFO: 127.0.0.1:49467 - "GET /static/css/main.680a8f3c.chunk.css HTTP/1.1" 200 OK
INFO: 127.0.0.1:49469 - "GET /static/js/main.bcd1329b.chunk.js HTTP/1.1" 200 OK
INFO: ('127.0.0.1', 49470) - "WebSocket /graphql" [accepted]
2022-02-23 10:00:19,083 - uvicorn.error - INFO - ('127.0.0.1', 49470) - "WebSocket /graphql" [accepted]
INFO: 127.0.0.1:49468 - "GET /api/main HTTP/1.1" 200 OK
INFO: 127.0.0.1:49471 - "GET /manifest.json HTTP/1.1" 404 Not Found
INFO: 127.0.0.1:49468 - "GET /favicon.ico HTTP/1.1" 404 Not Found
Hi @scarvajalg, What happens when you open http://localhost:4141/api/main -- do you see a json with nodes, pipelines. Please do share a screenshot.
Hi! This is the result:
Hi @scarvajalg, thank you for the screenshot!
I observe that you don't seem to have any nodes, edges, or layers in your screenshot. @AntonyMilneQB do you have any thoughts on why these arrays would be empty?
@tynandebold I wonder if this is the same issue we fixed in https://github.com/kedro-org/kedro-viz/pull/729 (but isn't released yet).
@scarvajalg what command exactly are you running to start kedro viz
? If you're using the --pipeline
flag, does it work without that, i.e. just a pure kedro viz
?
I executed both kedro viz
and kedro viz --pipeline
and I got the same result, an empty visualization
Hmm, very weird. What is the output of kedro registry list
? And what does your pipeline_registry.py
look like?
This is the output of kedro registry list
pipeline_registry.py
:
def register_pipelines() -> Dict[str, Pipeline]:
"""Register the project's pipelines.
Returns:
A mapping from a pipeline name to a ``Pipeline`` object.
"""
ddv_master = premaster_pipeline() + sunarp_pipeline() + master_pipeline()
udv_pacifico = (
bienes_generales_pipeline() +
persona_pipeline() +
poliza_pipeline() +
producto_pipeline() +
referencia_pipeline() +
siniestro_pipeline()
)
udv_bcp = data_bcp_pipeline()
return {
"__default__": Pipeline([]),
"bienes_generales": bienes_generales_pipeline(),
"great_expectations": great_expectations_pipeline(),
"data_bcp": data_bcp_pipeline(),
"ddv_master": ddv_master,
"dom_referencia": dom_referencia_pipeline(),
"master": master_pipeline(),
"dom_persona": dom_persona_pipeline(),
"dom_poliza": dom_poliza_pipeline(),
"persona": persona_pipeline(),
"poliza": poliza_pipeline(),
"producto": producto_pipeline(),
"premaster": premaster_pipeline(),
"referencia": referencia_pipeline(),
"siniestro": siniestro_pipeline(),
"sunarp": sunarp_pipeline(),
"udv_pacifico": udv_pacifico,
"udv_bcp": udv_bcp,
"um_score_buro": um_score_buro_pipeline(),
"universal": udv_pacifico + udv_bcp,
}
Ah ok, I think the problem is going to be that you have an empty __default__
pipeline. I suspect that this will be fixed when https://github.com/kedro-org/kedro-viz/pull/729 is released.
For the time being, I think the easiest fix is to populate __default__
with something, e.g. "__default__": bienes_generales_pipeline()
. Or probably even just a completely fake pipeline like "__default__": Pipeline([node(lambda: None, None, "x")])
should fix it I think.
As you suggest, I populate __default__
with a fake pipeline.
Now, when I execute kedro viz I can see just one pipeline (persona pipeline
) but I can't visualize the other pipelines or all of them. I tried kedro viz --pipeline
to visualize another pipeline, and it's empty. It seems that now kedro viz only recognize persona pipeline.
What I should do to visualize all the pipelines?
Hmm, this is weird. Please could you try running the following to install a version of kedro-viz that includes the fix https://github.com/kedro-org/kedro-viz/pull/729.
pip uninstall kedro-viz
pip install https://github.com/kedro-org/kedro-viz/raw/test/main-package/package/dist/kedro_viz-4.3.1-py3-none-any.whl
I tried with the kedro viz version you sent but the result is the same. I only can visualize persona pipeline
I tried kedro viz --pipeline
and I can visualize the pipeline I want to. But I can't visualize all the pipelines of the project. My project is big, there is a limit of pipelines or items that kedro viz can create in the visualization? Maybe that's the problem I can't visualize all the pipelines.
If possible, could you please create a repo and share the link with us? That'll help us find a solution faster.
Hi @scarvajalg. I'm wondering if you're still having issues with this?
Hi, I still have the problem but is not possible to share the project repo with the team. I can visualize each pipeline individually so for now it works for the team I'm working
Got it, thank you for the update. Would it be possible for you to create a separate, simplified repository that demos the same problem you're facing?
I have got the same issue. All my pipelines are empty.
kedro viz --pipeline history
2022-03-22 10:49:34,616 - kedro_viz.integrations.pypi - INFO - Checking for update...
2022-03-22 10:49:35,309 - kedro.framework.session.store - INFO - `read()` not implemented for `BaseSessionStore`. Assuming empty store.
2022-03-22 10:49:35,334 - kedro.config.config - INFO - Config from path `/home/jj/Git/etl-projects/conf/local` will override the following existing top-level config keys: machine_input
2022-03-22 10:49:36,191 - kedro.config.config - INFO - Config from path `/home/jj/Git/etl-projects/conf/local` will override the following existing top-level config keys: store_machine
INFO: Started server process [114880]
2022-03-22 10:49:37,435 - uvicorn.error - INFO - Started server process [114880]
INFO: Waiting for application startup.
2022-03-22 10:49:37,436 - uvicorn.error - INFO - Waiting for application startup.
INFO: Application startup complete.
2022-03-22 10:49:37,437 - uvicorn.error - INFO - Application startup complete.
INFO: Uvicorn running on http://127.0.0.1:4141 (Press CTRL+C to quit)
2022-03-22 10:49:37,437 - uvicorn.error - INFO - Uvicorn running on http://127.0.0.1:4141 (Press CTRL+C to quit)
(google-chrome:115008): Gtk-WARNING **: 10:49:37.672: Theme parsing error: gtk.css:5822:26: '-shadow' is not a valid color name
(google-chrome:115008): Gtk-WARNING **: 10:49:37.673: Theme parsing error: gtk.css:5825:14: not a number
(google-chrome:115008): Gtk-WARNING **: 10:49:37.673: Theme parsing error: gtk.css:5826:13: not a number
(google-chrome:115008): Gtk-WARNING **: 10:49:37.673: Theme parsing error: gtk.css:5827:11: Expected a length
Opening in existing browser session.
INFO: 127.0.0.1:33666 - "GET / HTTP/1.1" 200 OK
[115049:115049:0100/000000.879141:ERROR:sandbox_linux.cc(377)] InitializeSandbox() called with multiple threads in process gpu-process.
INFO: ('127.0.0.1', 33670) - "WebSocket /graphql" [accepted]
2022-03-22 10:49:38,084 - uvicorn.error - INFO - ('127.0.0.1', 33670) - "WebSocket /graphql" [accepted]
INFO: 127.0.0.1:33666 - "GET /api/main HTTP/1.1" 200 OK
INFO: 127.0.0.1:33666 - "GET /favicon.ico HTTP/1.1" 404 Not Found
INFO: 127.0.0.1:33672 - "GET /manifest.json HTTP/1.1" 404 Not Found
INFO: ('127.0.0.1', 33674) - "WebSocket /graphql" [accepted]
2022-03-22 10:49:39,240 - uvicorn.error - INFO - ('127.0.0.1', 33674) - "WebSocket /graphql" [accepted]
INFO: ('127.0.0.1', 33676) - "WebSocket /graphql" [accepted]
2022-03-22 10:49:39,511 - uvicorn.error - INFO - ('127.0.0.1', 33676) - "WebSocket /graphql" [accepted]
INFO: ('127.0.0.1', 33678) - "WebSocket /graphql" [accepted]
2022-03-22 10:49:40,252 - uvicorn.error - INFO - ('127.0.0.1', 33678) - "WebSocket /graphql" [accepted]
pip list | grep kedro
kedro 0.17.7
kedro-viz 4.3.1
kedro registry list
- __default__
- api
- erp
- history
- machine
- report
Hi @jw-cpnet, please could you try running the following and see if that fixes it? Also, what happens if you run kedro viz
with no pipeline
argument?
pip uninstall kedro-viz
pip install https://github.com/kedro-org/kedro-viz/raw/test/main-package/package/dist/kedro_viz-4.3.1-py3-none-any.whl
Thank you! @AntonyMilneQB
The fix works perfectly! And yes, kedro viz (before fixing) without pipeline argument works too.
Excellent, thanks very much for letting us know. This fix will be included in the next release of kedro-viz, so when that is released you should just be able to pip install kedro-viz==4.3.2
and all will be good.
It looks like this issue is resolved! Thanks for raising it @jw-cpnet and have an awesome day! 🚀
Description
When I run kedro-viz from the command line, I get an error. I work with pyspark and python
Context
Can't visualize kedro pipelines and datasets graph
Steps to Reproduce
I tried with two different kedro versions 4.0.0 and 4.3.1
Expected Result
Visualize the pipelines graph in the browser
Actual Result
I get the following errors
Your Environment
Include as many relevant details as possible about the environment you experienced the bug in:
Checklist