pycaret_predict: base_model_trainer.py comparison

comparison base_model_trainer.py @ 9:c6c1f8777aae draft

planemo upload for repository https://github.com/goeckslab/gleam commit 4a11e8a4c4e9daa884bddedfa47090476c517667

author	goeckslab
date	Thu, 31 Jul 2025 15:41:24 +0000
parents	1aed7d47c5ec
children	e2a6fed32d54

comparison

equal deleted inserted replaced

-:1aed7d47c5ec
+:c6c1f8777aae
 self.best_model = self.exp.tune_model(self.best_model)
 self.results = self.exp.pull()
 if self.task_type == "classification":
 self.results.rename(columns={"AUC": "ROC-AUC"}, inplace=True)
-_ = self.exp.predict_model(self.best_model)
+prob_thresh = getattr(self, "probability_threshold", None)
+if self.task_type == "classification" and prob_thresh is not None:
+_ = self.exp.predict_model(self.best_model, probability_threshold=prob_thresh)
+else:
+_ = self.exp.predict_model(self.best_model)
 self.test_result_df = self.exp.pull()
 if self.task_type == "classification":
 self.test_result_df.rename(columns={"AUC": "ROC-AUC"}, inplace=True)
 def save_model(self):
 best_model_name = str(self.results.iloc[0]["Model"])
 except Exception:
 best_model_name = type(self.best_model).__name__
 LOG.info(f"Best model determined as: {best_model_name}")
 # 2) Compute training sample count
 try:
 n_train = self.exp.X_train.shape[0]
 except Exception:
 n_train = getattr(self.exp, "X_train_transformed", pd.DataFrame()).shape[0]
 total_rows = self.data.shape[0]
 # 3) Build setup parameters table
-all_params = self.setup_params
+all_params = self.setup_params.copy()
+if self.task_type == "classification" and hasattr(self, "probability_threshold"):
+all_params["probability_threshold"] = self.probability_threshold
 display_keys = [
 "Target",
 "Session ID",
 "Train Size",
 "Normalize",
 "Remove Outliers",
 "Remove Multicollinearity",
 "Polynomial Features",
 "Fix Imbalance",
 "Models",
+"Probability Threshold",
 ]
 setup_rows = []
 for key in display_keys:
 pk = key.lower().replace(" ", "_")
 v = all_params.get(pk)
 dv = bool(v)
 elif key == "Cross Validation Folds":
 dv = v if v is not None else "None"
 elif key == "Models":
 dv = ", ".join(map(str, v)) if isinstance(v, (list, tuple)) else "None"
+elif key == "Probability Threshold":
+dv = v if v is not None else "None"
 else:
 dv = v if v is not None else "None"
 setup_rows.append([key, dv])
 if hasattr(self.exp, "_fold_metric"):
 setup_rows.append(["best_model_metric", self.exp._fold_metric])

Mercurial > repos > goeckslab > pycaret_predict

comparison base_model_trainer.py @ 9:c6c1f8777aae draft