我在做參數調優NeuralProphet模型(你可以看到圖像的參數和代碼培訓)
當我試著並行化訓練,它給我權限錯誤。
為什麼我不能訪問文件夾/磚/火花/工作/ *”?我需要一些aditional寫在集群上的權限嗎?
我也離開錯誤回溯。謝謝你!
回溯(最近調用最後):文件“/磚/火花/ python / pyspark /工人。py”, 876行,在主進程()文件“/磚/火花/ python / pyspark /工人。py”, 868行,過程中序列化器。dump_stream (out_iter外部檔案)文件“/磚/火花/ python / pyspark /序列化器。py”, 325行,出現在dump_stream vs =列表(itertools。islice(迭代器,批處理))文件“/磚/ / python / pyspark / util火花。py”, 84行,在包裝器返回f (* args, * * kwargs)文件”命令- 3900447791436819 > <”,4號線,在<λ>文件”命令- 3900447791436817 > <”,第36行neural_prophet_cv文件“/ local_disk0 / .ephemeral_nfs / cluster_libraries / python / lib / python3.9 /網站/ neuralprophet /預報員。py”, 795行,適合metrics_df =自我。_train(文件“/ local_disk0 / .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ neuralprophet /預報員。py”, 2657行,在_train self.trainer。適合(文件“/ local_disk0 / .ephemeral_nfs / cluster_libraries / python / lib / python3.9 /網站/ mlflow /跑龍套/ autologging_utils /安全。py”, 555行,在safe_patch_function patch_function (call_original * args, * * kwargs)文件”/ local_disk0 .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ mlflow /跑龍套/ autologging_utils /安全。py”, 254行,在patch_with_managed_run結果= patch_function(原始* args, * * kwargs)文件”/ local_disk0 .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ mlflow / pytorch / _pytorch_autolog。py”, 370行,在patched_fit結果=原(自我,* args, * * kwargs)文件”/ local_disk0 .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ mlflow /跑龍套/ autologging_utils /安全。py”, 536行,在call_original返回call_original_fn_with_event_logging (_original_fn, og_args og_kwargs)文件“/ local_disk0 .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ mlflow /跑龍套/ autologging_utils /安全。py”, 471行,在call_original_fn_with_event_logging original_fn_result = original_fn (* og_args, * * og_kwargs)文件“/ local_disk0 .ephemeral_nfs cluster_libraries / python / lib / python3.9 /網站/ mlflow /跑龍套/ autologging_utils /安全。py”, 533行,在_original_fn original_result =原始文件(* _og_args, * * _og_kwargs)”/ local_disk0 / .ephemeral_nfs / cluster_libraries / python / lib / python3.9 /網站/ pytorch_lightning /訓練/教練。py”, 696行,在適合自己。_call_and_handle_interrupt( File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/trainer/trainer.py", line 650, in _call_and_handle_interrupt return trainer_fn(*args, **kwargs) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/trainer/trainer.py", line 735, in _fit_impl results = self._run(model, ckpt_path=self.ckpt_path) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/trainer/trainer.py", line 1154, in _run self._log_hyperparams() File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/trainer/trainer.py", line 1222, in _log_hyperparams logger.log_hyperparams(hparams_initial) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/utilities/rank_zero.py", line 32, in wrapped_fn return fn(*args, **kwargs) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/loggers/tensorboard.py", line 211, in log_hyperparams self.log_metrics(metrics, 0) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/utilities/rank_zero.py", line 32, in wrapped_fn return fn(*args, **kwargs) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/neuralprophet/logger.py", line 29, in log_metrics super(MetricsLogger, self).log_metrics(metrics, step) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/utilities/rank_zero.py", line 32, in wrapped_fn return fn(*args, **kwargs) File "/local_disk0/.ephemeral_nfs/cluster_libraries/python/lib/python3.9/site-packages/pytorch_lightning/loggers/tensorboard.py", line 236, in log_metrics raise ValueError(m) from ex ValueError: you tried to log -1 which is currently not supported. Try a dict or a scalar/tensor.
你好。謝謝你的回答。我可以管理的權限。
注:我使用Azure磚。