Apache Iceberg version
0.9.1 (latest release)
Please describe the bug 🐞
When calling count() on a DataScan, limit is not respected. Seems trivial but if I set a limit of 5 I expect 5 or less rows back, at least with a scan-like implementation
The underlying ArrowScan does not get passed the limit param
https://github.com/apache/iceberg-python/blob/main/pyiceberg/table/__init__.py#L1940
This results in scans taking longer due to not respecting the limit.
The fix will involve more than just passing the limit to the ArrowScan
Willingness to contribute