- Notifications
You must be signed in to change notification settings - Fork4.3k
Insights: apache/beam
Overview
Could not load contribution data
Please try again later
49 Pull requests merged by25 people
- Enable xlang access for SQS read functionality
#34327 merged
Mar 19, 2025 - Add option to disable Kafka metrics
#34303 merged
Mar 19, 2025 - [AnomalyDetection] Support offline detectors
#34311 merged
Mar 18, 2025 - Allow declaration of external dependencies for YAML UDFs.
#34073 merged
Mar 18, 2025 - Bump github.com/nats-io/nats.go from 1.39.0 to 1.39.1 in /sdks
#34029 merged
Mar 18, 2025 - Support directory separator for Python local filesystem
#34318 merged
Mar 18, 2025 - Bump golang.org/x/oauth2 from 0.26.0 to 0.28.0 in /sdks
#34190 merged
Mar 18, 2025 - [AnomalyDetection] Refactor and improve Specifiable
#34310 merged
Mar 18, 2025 - Expose add-modules JVM option in SDKHarnessOptions
#34289 merged
Mar 18, 2025 - [KafkaIO] Fix average record size data race and backlog estimation
#34165 merged
Mar 18, 2025 - Update roadmap page 28342
#34330 merged
Mar 18, 2025 - Bump golang.org/x/net from 0.35.0 to 0.37.0 in /sdks
#34328 merged
Mar 18, 2025 - Fix PostCommit Python ValidatesContainer with RC workflow
#34332 merged
Mar 18, 2025 - Fix Precommit SQL
#34320 merged
Mar 18, 2025 - added test_accessing_valueprovider_info_after_run
#34315 merged
Mar 18, 2025 - Consider workflow flaky if the 5 last runs were failed
#34321 merged
Mar 17, 2025 - Honor JAVA_HOME when it is set
#34313 merged
Mar 17, 2025 - Add Year and Apache Beam in Copyright
#34306 merged
Mar 15, 2025 - Add a STRING format to PubSub reading that interpretes the payload as utf-8 encoded.
#34301 merged
Mar 15, 2025 - Better error messages for unavailable providers.
#34304 merged
Mar 15, 2025 - refactor main.py so that it can be called by runner.run_async().
#34290 merged
Mar 14, 2025 - Fix pre/postprocess type hints
#34298 merged
Mar 14, 2025 - Healthcare Label Update
#34299 merged
Mar 14, 2025 - docs: fix typo in snippets_test.py
#34297 merged
Mar 14, 2025 - docs: update CHANGES.md for Spark Runner
#34295 merged
Mar 14, 2025 - add default port for HostAndPort instances used in Windmill
#34061 merged
Mar 14, 2025 - [AnomalyDetection] Support functions and classes as init arguments in specifiable.
#34273 merged
Mar 14, 2025 - Use Avro 1.8-compatible Schema constructors in Storage Write API translator
#34281 merged
Mar 14, 2025 - Updates 2.64.0 release notes to highlight recent additions to the Managed API
#34291 merged
Mar 13, 2025 - Use BoundedTrie metric to track lineage in IO
#33891 merged
Mar 13, 2025 - Update join beam doc
#34284 merged
Mar 13, 2025 - Spark Runner : Fix invalid translateImpulse
#34272 merged
Mar 13, 2025 - Eliminate springframework dependency in KafkaIO
#34278 merged
Mar 13, 2025 - Fix spotless
#34279 merged
Mar 13, 2025 - Fix Stress Tests Java Bigquery
#34078 merged
Mar 13, 2025 - Remove 2.1 and 2.2 and add 2.8 for Kafka compatibility test
#34265 merged
Mar 13, 2025 - Refactor per worker histogram metrics
#34244 merged
Mar 13, 2025 - Update code-change-guide.md
#34266 merged
Mar 12, 2025 - Fix PostCommit Python ValidatesContainer Dataflow With RC job
#34246 merged
Mar 12, 2025 - Update pinned kafka version to 3.9.0 in Expansion Service.
#34197 merged
Mar 12, 2025 - Update to latest bom
#34231 merged
Mar 12, 2025 - Fix PostCommit Java PVR Spark3 Streaming job
#34253 merged
Mar 12, 2025 - [Java] Added Metrics Configuration Support to Iceberg Data Writers
#34140 merged
Mar 12, 2025 - Add yaml examples generation to the release scripts.
#34217 merged
Mar 12, 2025 - Update DF Python container tag
#34255 merged
Mar 12, 2025 - Putting the default retry settings which was overridden in Dataflow templates
#34161 merged
Mar 12, 2025 - Use same format for next and current release versions
#34262 merged
Mar 12, 2025 - Bump dataflow java container version - 20250312
#34256 merged
Mar 12, 2025 - [AnomalyDetection] Refactor code and add more docstrings
#34235 merged
Mar 12, 2025
29 Pull requests opened by19 people
- [Java]Add Support for Named Schemas in SpannerIO.Write by Enhancing Schema Retrieval
#34261 opened
Mar 12, 2025 - [IcebergIO] Filter out data files that have already been committed
#34264 opened
Mar 12, 2025 - Bump golang.org/x/net from 0.23.0 to 0.36.0 in /learning/katas/go
#34267 opened
Mar 12, 2025 - Fix Republish Released Docker Images workflow
#34268 opened
Mar 13, 2025 - Making setServiceFactory public in SpannerConfig for Failure injection testing in Dataflow templates
#34271 opened
Mar 13, 2025 - Add sdf kafka poll latencies
#34275 opened
Mar 13, 2025 - Update docstring to clarify numShards=0 doesn't neccessarily enable autosharding
#34280 opened
Mar 13, 2025 - Fix programming guide python reference for combining pcollection into a single value
#34282 opened
Mar 13, 2025 - do not block other threads from health checking if a stream is blocked
#34283 opened
Mar 13, 2025 - [Dataflow Streaming] WindmillTimerInternals: Use a single map to store timer data + liveness
#34292 opened
Mar 14, 2025 - Bump google.golang.org/api from 0.221.0 to 0.226.0 in /sdks
#34293 opened
Mar 14, 2025 - Revert Skip BoundedTrie on Dataflow till service is have BoundedTrie …
#34294 opened
Mar 14, 2025 - [Java]Fix NPE in UnboundedSolaceReader.getWatermark during disconnection (fixes #32660)
#34296 opened
Mar 14, 2025 - Add STRING format to ReadFromKafka schema transform.
#34302 opened
Mar 14, 2025 - More precise binary operation inference.
#34305 opened
Mar 15, 2025 - [Java] Add Gauge Metric Extraction to DataflowMetrics
#34307 opened
Mar 15, 2025 - Add an experiment named enableLineageRollup which when passed to Java…
#34312 opened
Mar 16, 2025 - Fix typo in BigQuery Python Documentation
#34317 opened
Mar 16, 2025 - Add 3 examples in playground SQL transform, schema transform and Composite Combine
#34322 opened
Mar 17, 2025 - Aggregation option in Kinesis Writer Python sdk
#34323 opened
Mar 17, 2025 - Key by paneindex and reshuffle before loading files.
#34324 opened
Mar 17, 2025 - Update pypi documentation 30145
#34329 opened
Mar 18, 2025 - [KafkaIO] Improve caching in backlog estimation and processing
#34331 opened
Mar 18, 2025 - Kafka Consumer Group Properties
#34334 opened
Mar 18, 2025 - Remove cancelled tasks from ReadOperation queue when shutting down
#34335 opened
Mar 18, 2025 - Fix pandas doctests sensitive to NumpyExtensionArray formatting.
#34336 opened
Mar 18, 2025 - Bump cloud.google.com/go/storage from 1.50.0 to 1.51.0 in /sdks
#34340 opened
Mar 19, 2025 - Bump github.com/aws/aws-sdk-go-v2/service/s3 from 1.77.0 to 1.78.2 in /sdks
#34341 opened
Mar 19, 2025
30 Issues closed by7 people
- [Bug]: Python 3.12 in-compatibility of Apache Beam
#32617 closed
Mar 18, 2025 - [Bug]: apache-beam is unusable in recent python due to pinning an old dill library from 2019
#32842 closed
Mar 18, 2025 - [Feature Request]: Upgrade or provide ETA on dill
#22893 closed
Mar 18, 2025 - Python local filesystem match does not work without directory separator.
#19741 closed
Mar 18, 2025 - [Feature Request]: Enable to configure `--add-modules` jvm flag
#30281 closed
Mar 18, 2025 - [Bug]: Roadmap page is out of date.
#28342 closed
Mar 18, 2025 - [Bug]: beam_PreCommit_PythonDocker misconfigured running on snapshot container
#33558 closed
Mar 18, 2025 - The PostCommit Python ValidatesContainer Dataflow With RC job is flaky
#30525 closed
Mar 18, 2025 - The PreCommit SQL job is flaky
#34314 closed
Mar 18, 2025 - Add test for snippet accessing_valueprovider_info_after_run
#19731 closed
Mar 18, 2025 - The PostCommit Java ValidatesRunner Flink Java8 job is flaky
#32949 closed
Mar 17, 2025 - Prefer java binary from $JAVA_HOME in JavaJarServer
#21006 closed
Mar 17, 2025 - Remove deprecated 'compare' argument from combiners.Top in PyDocs
#19819 closed
Mar 16, 2025 - Update Python SDK example tests to use assert_that
#18029 closed
Mar 16, 2025 - Log GCS upload ID for Python GCS connector
#21577 closed
Mar 15, 2025 - The StressTests Java BigQueryIO job is flaky
#31968 closed
Mar 15, 2025 - [Bug]: Python doc misses Copyright
#34287 closed
Mar 15, 2025 - [Bug]: typo in docs - Python Composition with AfterWatermark example
#34224 closed
Mar 14, 2025 - Improve case studies section of website
#21281 closed
Mar 14, 2025 - The PostCommit Python job is flaky
#30513 closed
Mar 14, 2025 - Beam directRunner documentation java tab has python information
#19884 closed
Mar 14, 2025 - Add support for checkpointing in Spark streaming
#20426 closed
Mar 13, 2025 - [Task]: Update KafkaIO ConsumerSpEL and eliminate springframework dependencies
#34277 closed
Mar 13, 2025 - The Republish Released Docker Images job is flaky
#33834 closed
Mar 13, 2025 - The PostCommit Java PVR Spark3 Streaming job is flaky
#34207 closed
Mar 12, 2025 - [Feature Request]: [IcebergIO] Configure data writers to track metrics
#34112 closed
Mar 12, 2025 - [Failing Test]: dataflow runner worker project test stuck causing Java PreCommit time out
#28957 closed
Mar 12, 2025 - [Failing Test]: org.apache.beam.sdk.io.mqtt.MqttIOTest.testReadWithMetadata
#34175 closed
Mar 12, 2025
12 Issues opened by11 people
- [Feature Request]: Add support for tracking lineage with BoundedTrie
#34342 opened
Mar 19, 2025 - Performance Regression or Improvement: gbk_python_batch_load_test_2gb_of_10B_records:runtime
#34337 opened
Mar 18, 2025 - [Bug]: withOutputTags (aka side output) requires ParDo to run on each output separately
#34333 opened
Mar 18, 2025 - [Feature Request]: Enable AWS SQS reads for Python SDK
#34326 opened
Mar 17, 2025 - [Feature Request]: Record Aggregation option in Kinesis Writer
#34319 opened
Mar 17, 2025 - [Bug]: Go Prism Runner has a Data Race in v2.63.0
#34316 opened
Mar 16, 2025 - [Task]: Improve README.md for https://github.com/apache/beam-starter-java-provider
#34308 opened
Mar 15, 2025 - Big Query Python Documentation Typo
#34300 opened
Mar 14, 2025 - [Bug]: Unknown spanner type FLOAT32>(VECTOR_LENGTH=>128
#34276 opened
Mar 13, 2025 - [Bug]: Bigquery python streaming insertAll SSLError leads to stuck streaming job
#34270 opened
Mar 13, 2025
72 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
- Add Triton Inference Server Support
#34252 commented on
Mar 18, 2025 • 16 new comments - Fail Fast if Resources Do Not Exist in Kafka Cluster
#34258 commented on
Mar 18, 2025 • 7 new comments - Add streaming example to AlloyDB colab.
#34251 commented on
Mar 12, 2025 • 3 new comments - Fix precommit flink container
#34250 commented on
Mar 19, 2025 • 3 new comments - add vendor to manually shutdown and restart GetWorkerMetadataStream to prevent DEADLINE_EXCEEDED errors
#34053 commented on
Mar 14, 2025 • 2 new comments - Fix: Replace deprecated DataFrame.applymap with map for compatibility
#33532 commented on
Mar 18, 2025 • 2 new comments - [Python] File staging to user worker support
#34208 commented on
Mar 13, 2025 • 2 new comments - Move ExecutionStateTracker.nextBundleLullDurationReportMs reset attempt to resolve race detection
#34214 commented on
Mar 18, 2025 • 2 new comments - Add Encryption When Writing to Iceberg Tables in RecordWriter.java
#34021 commented on
Mar 16, 2025 • 1 new comment - #34009 avro generic record to beam row conversion added support for a…
#34024 commented on
Mar 17, 2025 • 1 new comment - add equals hashCode to BoundedToUnboundedSourceAdapter
#34057 commented on
Mar 18, 2025 • 1 new comment - [Managed Iceberg] unbounded source
#33504 commented on
Mar 18, 2025 • 1 new comment - [DO NOT MERGE] Removed < 1.66 for grpcio
#34196 commented on
Mar 16, 2025 • 1 new comment - feat:large-row-skip-in-bigtable | added experimental options to skip …
#34245 commented on
Mar 13, 2025 • 1 new comment - Request to use BASIC enum when calling tables.get() in BigQuery #34075
#34249 commented on
Mar 18, 2025 • 0 new comments - [Python] Fix WriteToBigQuery transform using CopyJob does not work with WRITE_TRUNCATE write disposition (#34247)
#34248 commented on
Mar 18, 2025 • 0 new comments - Switch to use registerFileSystemsOnce for SerializablePipelineOptions constructor
#34028 commented on
Mar 18, 2025 • 0 new comments - Update allowed lateness in FixedWindows example to 2 days
#34227 commented on
Mar 18, 2025 • 0 new comments - [BEAM-6394] Add support to write protobuf data using ProtoParquetReader
#34063 commented on
Mar 15, 2025 • 0 new comments - Bump github.com/fsouza/fake-gcs-server from 1.52.1 to 1.52.2 in /sdks
#34094 commented on
Mar 18, 2025 • 0 new comments - Update MongoDB driver to mongodb-driver-legacy:5.3.1
#34100 commented on
Mar 18, 2025 • 0 new comments - [Prism] Refactor stageState to a behavior interface to reduce branch combinatorics
#34132 commented on
Mar 18, 2025 • 0 new comments - [Python] Add caching for BigQuery table definitions
#34135 commented on
Mar 18, 2025 • 0 new comments - Fix Docker build error by adding fallback for python3.12-distutils
#34144 commented on
Mar 18, 2025 • 0 new comments - test commit
#34148 commented on
Mar 17, 2025 • 0 new comments - Bump @octokit/plugin-paginate-rest and @octokit/rest in /scripts/ci/issue-report
#34167 commented on
Mar 18, 2025 • 0 new comments - Enable cloudpickle default
#34223 commented on
Mar 12, 2025 • 0 new comments - [Java] Add InsertRetryPolicy for non-successful BigQuery insertAll responses
#34222 commented on
Mar 19, 2025 • 0 new comments - Rethrowing Exception from CassandraIO's ReadFn
#34191 commented on
Mar 18, 2025 • 0 new comments - Fix ProtoCoder NoSuchMethodException
#34194 commented on
Mar 12, 2025 • 0 new comments - [Java] Ensure Pipeline Execution Requires Configuration Options or Logs Warning
#34220 commented on
Mar 17, 2025 • 0 new comments - [KafkaIO] Remove duplicate offset in range check
#34201 commented on
Mar 18, 2025 • 0 new comments - [KafkaIO] Update tracker and watermark for non-visible progress
#34202 commented on
Mar 18, 2025 • 0 new comments - Add Documentation Hint for Template Job Creation in DataflowRunner
#34204 commented on
Mar 18, 2025 • 0 new comments - Add support for top-level table properties table creation
#34205 commented on
Mar 16, 2025 • 0 new comments - [Java] Added tests for S3ReadableSeekableByteChannel
#34219 commented on
Mar 17, 2025 • 0 new comments - [Java] Add parsedData to Hl7v2Message and Update HL7v2IO Docs
#34213 commented on
Mar 15, 2025 • 0 new comments - Bump serialize-javascript and mocha in /sdks/typescript
#34012 commented on
Mar 13, 2025 • 0 new comments - Backlog metrics do not showing up in FlinkRunner
#25554 commented on
Mar 16, 2025 • 0 new comments - The Clean Up GCP Resources job is flaky
#31846 commented on
Mar 19, 2025 • 0 new comments - Make cloudpickle the default pickle library
#21298 commented on
Mar 18, 2025 • 0 new comments - [Task]: Improve project documentation on apache-beam pypi page.
#30145 commented on
Mar 18, 2025 • 0 new comments - [Feature Request]: Replace snappy with other `crc32c ` packages for tfrecordio
#34226 commented on
Mar 17, 2025 • 0 new comments - [Bug]: ./gradlew checkSetup fails with Java 8 on Docker dev image
#34242 commented on
Mar 16, 2025 • 0 new comments - [Bug]: The submission_environment_dependencies.txt file does not get staged when running with Flink runner on Dataproc
#32743 commented on
Mar 15, 2025 • 0 new comments - [Bug]: Update or Remove Integrations page
#27613 commented on
Mar 15, 2025 • 0 new comments - [Bug]: Javadoc still incorrectly renders non-ascii characters
#25427 commented on
Mar 14, 2025 • 0 new comments - [Feature Request]: Running Word-Count with Gradle for PowerShell
#32307 commented on
Mar 13, 2025 • 0 new comments - The PostCommit Java IO Performance Tests job is flaky
#30527 commented on
Mar 13, 2025 • 0 new comments - Beam with intellij: "Could not find implementation class 'org.apache.beam.gradle.BeamModulePlugin' for plugin 'org.apache.beam.module'"
#21582 commented on
Mar 13, 2025 • 0 new comments - [Feature Request]: {Managed IO Iceberg} - Allow users to run streaming reads
#33092 commented on
Mar 12, 2025 • 0 new comments - [Feature Request]: YAML to support creating BQT4AI
#33726 commented on
Mar 12, 2025 • 0 new comments - [YAML][Feature Request]: Integration and Usage of GCP Secret Manager with the different Apache Beam IOs
#32665 commented on
Mar 12, 2025 • 0 new comments - [Failing Test]: beam_PreCommit_Flink_Container not build Python SDK container causing test failure at PR run
#33942 commented on
Mar 12, 2025 • 0 new comments - [Bug]: SpannerIO.Write does not support named schemas
#32907 commented on
Mar 12, 2025 • 0 new comments - Bump @octokit/request-error, @actions/github and @octokit/rest in /scripts/ci/pr-bot
#33998 commented on
Mar 18, 2025 • 0 new comments - Upgrade Kafka client version to 3.7.0
#33960 commented on
Mar 18, 2025 • 0 new comments - Adding credentials option to PubsubMessageMatcher
#33958 commented on
Mar 18, 2025 • 0 new comments - SnowflakeIO: be consistent with backslash escape char
#33948 commented on
Mar 18, 2025 • 0 new comments - Enable timeout setting for Python TestPipeline (#29646)
#33866 commented on
Mar 18, 2025 • 0 new comments - Fix incorrect nullness in FlinkJobInvoker and JobInvoker
#33713 commented on
Mar 18, 2025 • 0 new comments - [Dataflow Streaming] Cache StateNamespace encoded keys using SoftReference
#33689 commented on
Mar 18, 2025 • 0 new comments - Add support for Iceberg table identifiers with special characters
#33648 commented on
Mar 18, 2025 • 0 new comments - Remove type coercion of `FinalizeWrite` in iobase.py
#33614 commented on
Mar 18, 2025 • 0 new comments - Support sharding in WriteToFiles (tested for to_csv)
#33612 commented on
Mar 18, 2025 • 0 new comments - Handle missing tables more gracefully.
#33610 commented on
Mar 17, 2025 • 0 new comments - [WIP] [Do not Merge] [Dataflow Streaming] Support for using virtual harness threads
#33543 commented on
Mar 15, 2025 • 0 new comments - Tour of Beam: update GroupByKey example
#33242 commented on
Mar 18, 2025 • 0 new comments - Read RabbitMQ messages with headers containing nested objects
#33072 commented on
Mar 15, 2025 • 0 new comments - add generics support to AutoValueUtils helpers
#32977 commented on
Mar 18, 2025 • 0 new comments - BigQueryIO uniformize direct and export reads
#32360 commented on
Mar 15, 2025 • 0 new comments - Support writing to Pubsub with ordering key; Add PubsubMessage SchemaCoder
#31608 commented on
Mar 15, 2025 • 0 new comments