Tehe program is using Selenium 4.x or 3.x and connects to Chrome websocket on its own using sta/websocket-sharp and connects to regular socket using dynamic RestClient and uses fastJSON to compose and process CDP API messages. This project was mirroring the sergueik/cdp_webdriver which in turn was fork of ahajamit/chrome-devtools-webdriver-integration which interact with Chrome through websockets discovered in the selenium log. This communication did not actually require the Selenium 4.x
-
there is intentionally no
async
code to allow development code in the now defunct SharpDevelop IDE which never became able to parse the C# 5.x syntax -
for the same reason (outdated
nuget.exe
) theSelenium.WebDriver
dependency version4.x
has tobe downloaded manually:
$VERSION = '4.2.0'
. .\download_nuget_package.ps1 -version $VERSION -package_name Selenium.WebDriver
. .\download_nuget_package.ps1 -package_name 'Selenium.Support' -version $VERSION
there is also a utility to clean the packages from numerous target platform assemblies, keeping only net40
and net45
. .\prune_unneded_nuget_packages.ps1
this will unlink these
packages\Selenium.Webdriver.4.2.0\lib\net46\WebDriver.dll
packages\Selenium.Webdriver.4.2.0\lib\net47\WebDriver.dll
packages\Selenium.Webdriver.4.2.0\lib\net48\WebDriver.dll
packages\Selenium.Webdriver.4.2.0\lib\net5.0\WebDriver.dll
packages\Selenium.Webdriver.4.2.0\lib\netstandard2.0\WebDriver.dll
packages\Selenium.Webdriver.4.2.0\lib\netstandard2.1\WebDriver.dll
It reads the current sesssion Selenium logs looking for
DevTools HTTP Request: http://localhost:57161/json/version
technically there is also entry
DevTools HTTP Response: {
"Browser": "Chrome/104.0.5112.102",
"Protocol-Version": "1.3",
"User-Agent": "Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.5112.102 Safari/537.36",
"V8-Version": "10.4.132.22",
"WebKit-Version": "537.36 (@0a2e5078088fc9f9a29247aaa40af9e7ada8b79f)",
"webSocketDebuggerUrl": "ws://localhost:57161/devtools/browser/6fbfaec5-4de5-4ae2-86ae-89fcc819e704"
}
but it ignores it and reads the json from http://localhost:57161/json
:
which looks like:
[ {
"description": "",
"devtoolsFrontendUrl": "/devtools/inspector.html?ws=localhost:58045/devtools/page/D699103DDEC2772BB4C00ACD73D2BC3C",
"id": "D699103DDEC2772BB4C00ACD73D2BC3C",
"title": "localhost:58045",
"type": "page",
"url": "http://localhost:58045/json",
"webSocketDebuggerUrl": "ws://localhost:58045/devtools/page/D699103DDEC2772BB4C00ACD73D2BC3C"
}, {
"description": "",
"devtoolsFrontendUrl": "/devtools/inspector.html?ws=localhost:58045/devtools/page/E8914EBAE6858C7923841CFEA27EE8EC",
"id": "E8914EBAE6858C7923841CFEA27EE8EC",
"title": "data:,",
"type": "page",
"url": "data:,",
"webSocketDebuggerUrl": "ws://localhost:58045/devtools/page/E8914EBAE6858C7923841CFEA27EE8EC"
} ]
the entries of page
type
are used to collect the webSocketDebuggerUrl
Then it performs a WebSocket connection to that url
var ws = new WebSocket(wsurl)
and sends CDP command
ws.Send(@"{""id"":534427,""method"":""Browser.getVersion"",""params"":{}}");
and gets results, e.g.
result:
{
"id": 534427,
"result": {
"protocolVersion": "1.3",
"product": "Chrome/104.0.5112.102",
"revision": "@0a2e5078088fc9f9a29247aaa40af9e7ada8b79f",
"userAgent": "Mozilla/5.0 (Windows NT 6.3; Win64; x64) AppleWebKit/537.36 (K
HTML, like Gecko) Chrome/104.0.5112.102 Safari/537.36",
"jsVersion": "10.4.132.22"
}
}
the test sends the following message to CDP:
try {
using (var ws = new WebSocket(wsurl)) {
ws.OnMessage += (sender, e) => {
var data = e.Data;
Console.WriteLine("raw data: " + data);
Dictionary<string,object> response = JSON.ToObject<Dictionary<string,object>>(data);
var id = response["id"];
Console.WriteLine("result id: " + id);
var result = (Dictionary<string,object>)response["result"];
var userAgent = result["userAgent"];
Console.WriteLine("result userAgent: " + userAgent);
};
ws.Connect();
// NOTE: "params" is reserved
var param = new Dictionary<string,object>();
var message = new Dictionary<string,object>();
double latitude = 37.422290;
double longitude = -122.084057;
long accuracy = 100;
param["latitude"] = latitude;
param["longitude"] = longitude;
param["accuracy"] = accuracy;
message["params"] = param;
message["method"] = "Emulation.setGeolocationOverride";
message["id"] = 534428;
var payload = JSON.ToJSON(message);
Console.WriteLine(String.Format("sending: {0}", payload));
ws.Send(payload);
}
} catch (Exception ex) {
Console.WriteLine("ERROR: " + ex.ToString());
}
and clicks on "Show My Location" element:
driver.Navigate().GoToUrl("https://www.google.com/maps");
Thread.Sleep(10000);
By locator = By.CssSelector("div[jsaction*='mouseover:mylocation.main']");
IList<IWebElement> elements = driver.FindElements(locator);
Assert.IsTrue(elements.Count > 0);
elements[0].Click();
and the browser shows the googleplex neighborhood:
-
https://www.codeproject.com/Articles/618032/Using-WebSocket-in-NET-4-5-Part-2
-
https://www.codeproject.com/Articles/617611/Using-WebSocket-in-NET-4-5-Part-1
-
https://stackoverflow.com/questions/70912939/run-cdp-commands-on-selenium-c-sharp
-
https://medium.com/nerd-for-tech/your-first-c-websocket-client-5e7acc30681d
-
https://github.com/SeleniumHQ/selenium/blob/trunk/dotnet/src/webdriver/DevTools/ICommand.cs
-
https://stackoverflow.com/questions/36455533/c-sharp-selenium-access-browser-log
-
https://docs.microsoft.com/en-us/dotnet/api/system.net.websockets.clientwebsocket?view=net-6.0
-
discussion about
SeleniumExtras.WaitHelpers
nuget package -
https://stackoverflow.com/questions/65821815/how-to-use-expectedconditions-in-selenium-4