Tags: valiant-sun/pai
Tags
[cherry pick to pai-1.4.y] sync master change (microsoft#5144) * Visualized mnist500task (microsoft#5131) * update marketplace image version to 1.3.0 (microsoft#5137) Co-authored-by: AmberMsy <[email protected]>
Release v1.3.0 * Marketplace - New templates in marketplace (microsoft/openpaimarketplace#60) * HiveD Scheduler - Support cluster autoscale with HiveD scheduler on AKS (microsoft#4868) - Support dynamic sku types for different vc on webportal (microsoft#4900) * Advanced job debug mode - Add per task retry history (microsoft/frameworkcontroller#62, microsoft#4958, microsoft#4966) - Expose Kubernetes events (microsoft#4939, microsoft#4975) * GPU monitoring and utilization - Support job tagging (microsoft#4924) - Stop low GPU utilization job with alert-manager (microsoft#4940) - Cordon node with GPU ECC Errors (microsoft#4942) * Documentation - Fix document according to DRI tickets (microsoft#4828) - Add distributed examples (microsoft#4821) * Webportal - Add help info for items on webportal (microsoft#4950)
Update release note 1.2.0 (microsoft#4928) * update release note v1.2.0 * update README * Add upgrade guide link to RELEASE_NOTE * Old framework retry history cannot be shown after upgrading to v1.2.0 * Add link for all new items
Cherry-pick - Fix setuptools version in dev-box (microsoft#4596) * fix * fix
PreviousNext