Hello guys,
I get this error when trying to run an ingest pipeline from on-prem SQL to BigQuery:
"Spark program 'phase-1' failed with error: The TCP/IP connection to the host xx.xx.xx, port 1433 has failed. Error: "connect timed out. Verify the connection properties. Make sure that an instance of SQL Server is running on the host and accepting TCP/IP connections at the port. Make sure that TCP connections to the port are not blocked by a firewall.".. Please check the system logs for more details.
I did a "connectivity test" in GCP where the destination was reachable. However when I try to ping the SQL server it cant reach from a Linux instance in the same network. I cannot reach it using Dataproc (SSH) either.
When looking at all completed Dataproc ingest jobs, they all have succeeded.
What can be the issue?
I have also checked that the TCP connections to the port are not blocked by a firewall.
Here are several steps and considerations to help troubleshoot and resolve the issue:
Direct Port Connectivity Tests:
nc -zv xx.xx.xx 1433
). Successful tests indicate open network paths and ports, while failures suggest network or firewall issues requiring attention.SQL Server Configuration and Firewall:
Networking and Security:
Understanding Successful Dataproc Jobs:
Additional Investigation Points:
User | Count |
---|---|
4 | |
1 | |
1 | |
1 | |
1 |