The sandboxed testing environment cannot use AWS

Commenting on this issue instead of creating a new one because this is related to the testing infrastructure.

Currently, our testing infrastructure recreates the AWS stack (S3 and DynamoDB) in Docker containers. This works okay but comes with limitations:

we cannot test the streamChanges feature because that feature has hard-coded dependency on the real AWS (we cannot customize the endpoint used by some clients)
some authentication scenarios are hard to test properly: ultimately, we want to test that our authentication logic works with the real AWS, so there is no point in mocking AWS in a Docker container
our infrastructure does not allow us to perform benchmarks to ensure that we do not introduce performance regressions

While the first point could be (and should be, ideally) fixed by removing the hard-coded dependency on AWS, to address the second point we have no choice but having tests that use the real AWS. And, in practice, to fix the first point we would have to change things in our copy of the spark-kinesis project, which is undesirable (it is better to keep our copy as close as possible to the original so that we can merge the upstream improvements into our copy).

I believe those points motivate the need for having tests that use the real AWS instead of a containerized implementation of AWS. Except for the benchmarks, such tests should not be expensive because they would not consume a lot of bandwidth.

I propose the following course of action:

create AWS credentials that we can use for creating/deleting DynamoDB tables on the real AWS
create a new test module that tests the supported AWS authentication scenarios and the streaming feature
create a new GitHub workflow triggered by an explicit comment such as “Test on AWS” that runs the new test module
create a new test module that tests the migration throughput between real AWS (or real Apache Cassandra) and real ScyllaDB, and create a new GitHub workflow triggered by an explicit comment such as “Test performance” to run the module.

scylladb / scylla-migrator

The sandboxed testing environment cannot use AWS #113