Closed phschimm closed 9 months ago
I've posted a more detailed investigation about this problem on StackOverflow: https://stackoverflow.com/questions/71053554/why-can-random-cut-forests-record-set-method-for-data-conversion-upload-not
Can someone maybe identify, which API version was used in this post here?
If I had that information, I could downgrade my notebook instance, execute my experiments, and get the quality metrics I need.
Hello, I am also experiencing this issue and wanted to know if there is any work around this problem?
Describe the bug The method
sagemaker.RandomCutForest.record_set()
can't be used to create aRecordSet
for the "test" channel of the RCF algorithm.To reproduce Configure a RandomCutForest estimator and try fitting it to data ingested via
record_set(..., channel='test')
:Expected behavior A
RecordSet
returned byrecord_set(..., channel='test')
should have"S3DataDistributionType": "FullyReplicated"
.Screenshots or logs
System information A description of your system. Please provide:
Additional context This property is hardcoded in the
RecordSet
class utilized byrecord_set()
:https://github.com/aws/sagemaker-python-sdk/blob/2ebba8a454de03a2bc49267c91dbacddd6183585/src/sagemaker/amazon/amazon_estimator.py#L340
@mufaddal-rohawala @jeniyat or anyone else: In the meantime, is there any other way to create a
RecordSet
for RCF from Numpy data?