$ utc; grab-site --input-file=in.txt; utc 2024-04-12T17:47:55.235310221Z Manhole[3167786:1712944077.8595]: Patched and . Manhole[3167786:1712944077.8657]: Manhole UDS path: /tmp/manhole-3167786 Manhole[3167786:1712944077.8657]: Waiting for new connection (in pid:3167786) ... Created lmdb db with map_size=1099511627776 Imported /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb/igsets Using these 190 ignores: %25252525 /%22%20\+[^/]+\+%20%22 /%22\+[^/]+\+%22 /%27%20\+[^/]+\+%20%27 /%27\+[^/]+\+%27 /%5C/%5C/ /'\+[^/]+\+' /(%5C)+(%22|%27) /App_Themes/.+/App_Themes/ /\\+(%22|%27) /\\+["'] /\\/\\/ /bxSlider/.+/bxSlider/ /bxSlider/bxSlider/ /clientscript/.+/clientscript/clientscript/ /clientscript/clientscript/.+/clientscript/ /clientscript/clientscript/clientscript/ /css/.+/css/css/ /css/css/.+/css/ /css/css/css/ /images/.+/images/images/ /images/images/.+/images/ /images/images/images/ /img/.+/img/img/ /img/img/.+/img/ /img/img/img/ /js/.+/js/js/ /js/js/.+/js/ /js/js/js/ /lib/exe/.*lib[-_]exe[-_]lib[-_]exe[-_] /scripts/.+/scripts/scripts/ /scripts/scripts/.+/scripts/ /scripts/scripts/scripts/ /slides/.+/slides/slides/ /slides/slides/.+/slides/ /slides/slides/slides/ /styles/.+/styles/styles/ /styles/styles/.+/styles/ /styles/styles/styles/ ^https?://((s-)?static\.ak\.fbcdn\.net|(connect\.|www\.)?facebook\.com)/connect\.php/js/.*rsrc\.php ^https?://(/.*|/)page/%d/$ ^https?://([^/]+\.)?gdcvault\.com(/.*/|/)(fonts(/.*/|/)fonts/|css(/.*/|/)css/|img(/.*/|/)img/) ^https?://([^\./]+\.)?stream\.publicradio\.org/ ^https?://([^\.]+\.)?pinterest\.com/pin/create/ ^https?://(\d|www|secure)\.gravatar\.com/avatar/ad516503a11cd5ca435acc9bb6523536 ^https?://(apis|plusone)\.google\.com/_/\+1/ ^https?://(audio\d?|nfw)\.video\.ria\.ru/ ^https?://(ssl\.|www\.)?reddit\.com/(login\?dest=|submit\?|static/button/button) ^https?://(www\.)?(megaupload|filesonic|wupload)\.com/ ^https?://(www\.)?digg\.com/submit\? ^https?://(www\.)?facebook\.com/(plugins/(share_button|like(box)?)\.php|sharer/sharer\.php|sharer?\.php|dialog/(feed|share))\? ^https?://(www\.)?facebook\.com/v[\d\.]+/plugins/like\.php ^https?://(www\.)?friendfeed\.com/share\? ^https?://(www\.)?instapaper\.com/hello2\? ^https?://(www\.)?myspace\.com/Modules/PostTo/ ^https?://(www\.)?stumbleupon\.com/(submit\?|badge/embed/) ^https?://(www\.)?technorati\.com/faves/?\?add= ^https?://(www\.)?twitter\.com/(share\?|intent/((re)?tweet|favorite)|home/?\?status=|\?status=) ^https?://(www\.)?xing\.com/(app/user\?op=share|social_plugins/share\?) ^https?://(www|draft)\.blogger\.com/(navbar\.g|post-edit\.g|delete-comment\.g|comment-iframe\.g|share-post\.g|email-post\.g|blog-this\.g|delete-backlink\.g|rearrange|blog_this\.pyra)\? ^https?://(www|px\.srvcs)\.tumblr\.com/(impixu\?|share(/link/?)?\?|reblog/) ^https?://(www|ssl)\.google-analytics\.com/(r/)?(__utm\.gif|collect\?) ^https?://.+/.+/disqus\.com/forums/$ ^https?://.+/js-agent\.newrelic\.com/nr-\d{3}(\.min)?\.js$ ^https?://.+/js/chartbeat\.js$ ^https?://.+/stats\.g\.doubleclick\.net/dc\.js$ ^https?://.+\.blogspot\.(com|in|com\.au|co\.uk|jp|co\.nz|ca|de|it|fr|se|sg|es|pt|com\.br|ar|mx|kr)/(\d{4}/\d{2}/|search/label/)(CSI/$|.*/CSI/CSI/CSI/) ^https?:///(wp-admin/|wp-login\.php\?) ^https?:///.*%5Cx26route=/archive ^https?:///.*& ^https?:///.*(\?|%5Cx26)route=(/page/:page|/archive/:year/:month|/tagged/:tag|/post/:id|/image/:post_id) ^https?:///.*amp%3Bamp%3Bamp%3B ^https?:///.+/%3Ca%20href= ^https?:///.+/jetpack-comment/\?blogid=\d+&postid=\d+ ^https?:///.+/plugins/ultimate-social-media-plus/.+/like/like/ ^https?:///.+/quote-comment-\d+/$ ^https?:///.+[\?&](replyto(com)?|like_comment)=\d+ ^https?:///.+[\?&]mode=reply ^https?:///.+[\?&]share=[a-z]{4,} ^https?:///.+\?showComment(=|%5C)\d+ ^https?:///search(/label/[^\?]+|\?q=[^&]+|)[\?&]updated-(min|max)=\d{4}-\d\d-\d\dT\d\d:\d\d:\d\d.*&max-results=\d+ ^https?://[^/]*musicproxy\.s12\.de/ ^https?://[^/]+/.+/CaptchaImage\.axd ^https?://[^/]+/anony/mjpg\.cgi$ ^https?://[^/]+/mjpg/video\.mjpg ^https?://[^/]+\.akadostream\.ru(:\d+)?/ ^https?://[^/]+\.corp\.ne1\.yahoo\.com/ ^https?://[^/]+\.facebook\.com/login\.php ^https?://[^/]+\.gaduradio\.pl/ ^https?://[^/]+\.libsyn\.com/.+/%2[02]https?:/ ^https?://[^/]+\.rastream\.com(:\d+)?/ ^https?://[^/]+\.services\.livejournal\.com/ljcounter ^https?://[^/]+\.streamtheworld\.com/ ^https?://[^/]+\.xiti\.com/hit\.xiti\? ^https?://[^\./]+\.radioscoop\.(com|net):\d+/ ^https?://[^\./]+\.streamchan\.org:\d+/ ^https?://[^\.]+\.livejournal\.com/.+/\*sup_ru/ru/UTF-8/ ^https?://[^\.]+\.livejournal\.com/.+http://[^\.]+\.livejournal\.com/ ^https?://[a-z0-9]+\.cdn\.dvmr\.fr(:\d+)?/.+\.mp3 ^https?://\d+\.media\.tumblr\.com/avatar_.+_16\.pn[gj]$ ^https?://accounts\.google\.com/(SignUp|ServiceLogin|AccountChooser|a/UniversalLogin) ^https?://add\.my\.yahoo\.com/(rss|content)\? ^https?://air\.radiorecord\.ru(:\d+)?/ ^https?://alb\.reddit\.com/ ^https?://api\.addthis\.com/ ^https?://audio\d?\.radioreference\.com/ ^https?://audiots\.scdn\.arkena\.com/ ^https?://av\.rasset\.ie/av/live/ ^https?://b\.hatena\.ne\.jp/add\? ^https?://b\.scorecardresearch\.com/ ^https?://beacon\.wikia-services\.com/ ^https?://bookmark\.naver\.com/post\? ^https?://bufferapp\.com/add\? ^https?://connect\.mail\.ru/share\? ^https?://csp\.cyworld\.com/bi/bi_recommend_pop\.php\? ^https?://del\.icio\.us/post\? ^https?://delicious\.com/(save|post)\? ^https?://download\.ted\.com/ ^https?://flattr.com/submit/auto\? ^https?://gcnplayer\.gcnlive\.com/.+ ^https?://geo\.yahoo\.com/b\? ^https?://getpocket\.com/(save|edit)/?\? ^https?://i\.dev\.cdn\.turner\.com/ ^https?://imageshack\.com/lost$ ^https?://iwiw\.hu/pages/share/share\.jsp\? ^https?://mail\.google\.com/mail/ ^https?://media\.opb\.org/clips/embed/.+\.js$ ^https?://medium\.com/_/(vote|bookmark|subscribe)/ ^https?://memori(\.qip)?\.ru/link/\? ^https?://mp3\.ffh\.de/ ^https?://mp3tslg\.tdf-cdn\.com/ ^https?://myweb2\.search\.yahoo\.com/myresults/bookmarklet\? ^https?://news\.ycombinator\.com/submitlink\? ^https?://p\.opt\.fimserve\.com/ ^https?://photobucket\.com/.+/albums/.+/albums/ ^https?://pixel\.(quantserve|wp)\.com/ ^https?://pixel\.blog\.hu/ ^https?://pixel\.redditmedia\.com/pixel/ ^https?://platform\d?\.twitter\.com/widgets/tweet_button.html\? ^https?://play(\d+)?\.radio13\.ru:8000/ ^https?://plus\.google\.com/share\? ^https?://posterous\.com/share\? ^https?://prod-preview\.wired\.com/ ^https?://pub(\d+)?\.di\.fm/ ^https?://r-a-d\.io/.+\.mp3$ ^https?://r-login\.wordpress\.com/remote-login\.php ^https?://relay\.broadcastify\.com/ ^https?://reporter\.es\.msn\.com/\?fn=contribute ^https?://s\d+\.sitemeter\.com/(js/counter\.js|meter\.asp) ^https?://service\.weibo\.com/share/share\.php\? ^https?://share\.flipboard\.com/bookmarklet/popout\? ^https?://social-plugins\.line\.me/lineit/share ^https?://sphinn\.com/index\.php\?c=post&m=submit& ^https?://static\.licdn\.com/sc/p/.+/f// ^https?://static\.licdn\.com/sc/p/com\.linkedin\.nux(:|%3A)nux-static-content(\+|%2B)[\d\.]+/f/ ^https?://stream(\d+)?\.media\.rambler\.ru/ ^https?://telegram\.me/share/url\? ^https?://tm\.uol\.com\.br/h/.+/h/ ^https?://tmz\.vo\.llnwd\.net/ ^https?://video-subtitle\.tedcdn\.com/ ^https?://vkontakte\.ru/share\.php\? ^https?://vuible\.com/pins-settings/ ^https?://web\.archive\.org/web/[^/]+/https?\:/[^/]+\.addthis\.com/.+/static/.+/static/ ^https?://wow\.ya\.ru/posts_(add|share)_link\.xml\? ^https?://www\.addthis\.com/bookmark\.php\? ^https?://www\.addtoany\.com/(add_to/|share_save\?) ^https?://www\.amazon\.com/.+/logging/log-action\.html ^https?://www\.blinklist\.com/index\.php\?Action=Blink/addblink\.php ^https?://www\.blogger\.com/feeds/\d+/\d+/comments/default/\d+ ^https?://www\.blogger\.com/feeds/\d+/posts/default/\d+ ^https?://www\.dreamwidth\.org/tools/(memadd|tellafriend)\? ^https?://www\.flickr\.com/(explore/|photos/[^/]+/(sets/\d+/(page\d+/)?)?)\d+_[a-f0-9]+(_[a-z])?\.jpg$ ^https?://www\.flickr\.com/change_language\.gne ^https?://www\.google\.com/(reader/link\?|buzz/post\?) ^https?://www\.google\.com/accounts/AccountChooser ^https?://www\.google\.com/bookmarks/mark\? ^https?://www\.google\.com/recaptcha/(api|mailhide/d\?) ^https?://www\.infomous\.com/cloud_widget/lib/lib/ ^https?://www\.khaleejtimes\.com/.+/images/.+/images/ ^https?://www\.khaleejtimes\.com/.+/imgactv/.+/imgactv/ ^https?://www\.khaleejtimes\.com/.+/kt_.+/kt_ ^https?://www\.linkedin\.com/(cws/share|shareArticle)\? ^https?://www\.livejournal\.com/(tools/memadd|update|(identity/)?login)\.bml\? ^https?://www\.netvibes\.com/subscribe\.php\? ^https?://www\.newsvine\.com/_wine/save\? ^https?://www\.odnoklassniki\.ru/dk\?st\.cmd=addShare ^https?://www\.warnerbros\.com/\d+$ ^https?://www\.youtube\.com/.*\[\[.+\]\] ^https?://www\.youtube\.com/.*\{\{.+\}\} ^https?://zakladki\.yandex\.ru/newlink\.xml\? Imported /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb/ignores Using these 190 ignores: %25252525 /%22%20\+[^/]+\+%20%22 /%22\+[^/]+\+%22 /%27%20\+[^/]+\+%20%27 /%27\+[^/]+\+%27 /%5C/%5C/ /'\+[^/]+\+' /(%5C)+(%22|%27) /App_Themes/.+/App_Themes/ /\\+(%22|%27) /\\+["'] /\\/\\/ /bxSlider/.+/bxSlider/ /bxSlider/bxSlider/ /clientscript/.+/clientscript/clientscript/ /clientscript/clientscript/.+/clientscript/ /clientscript/clientscript/clientscript/ /css/.+/css/css/ /css/css/.+/css/ /css/css/css/ /images/.+/images/images/ /images/images/.+/images/ /images/images/images/ /img/.+/img/img/ /img/img/.+/img/ /img/img/img/ /js/.+/js/js/ /js/js/.+/js/ /js/js/js/ /lib/exe/.*lib[-_]exe[-_]lib[-_]exe[-_] /scripts/.+/scripts/scripts/ /scripts/scripts/.+/scripts/ /scripts/scripts/scripts/ /slides/.+/slides/slides/ /slides/slides/.+/slides/ /slides/slides/slides/ /styles/.+/styles/styles/ /styles/styles/.+/styles/ /styles/styles/styles/ ^https?://((s-)?static\.ak\.fbcdn\.net|(connect\.|www\.)?facebook\.com)/connect\.php/js/.*rsrc\.php ^https?://(/.*|/)page/%d/$ ^https?://([^/]+\.)?gdcvault\.com(/.*/|/)(fonts(/.*/|/)fonts/|css(/.*/|/)css/|img(/.*/|/)img/) ^https?://([^\./]+\.)?stream\.publicradio\.org/ ^https?://([^\.]+\.)?pinterest\.com/pin/create/ ^https?://(\d|www|secure)\.gravatar\.com/avatar/ad516503a11cd5ca435acc9bb6523536 ^https?://(apis|plusone)\.google\.com/_/\+1/ ^https?://(audio\d?|nfw)\.video\.ria\.ru/ ^https?://(ssl\.|www\.)?reddit\.com/(login\?dest=|submit\?|static/button/button) ^https?://(www\.)?(megaupload|filesonic|wupload)\.com/ ^https?://(www\.)?digg\.com/submit\? ^https?://(www\.)?facebook\.com/(plugins/(share_button|like(box)?)\.php|sharer/sharer\.php|sharer?\.php|dialog/(feed|share))\? ^https?://(www\.)?facebook\.com/v[\d\.]+/plugins/like\.php ^https?://(www\.)?friendfeed\.com/share\? ^https?://(www\.)?instapaper\.com/hello2\? ^https?://(www\.)?myspace\.com/Modules/PostTo/ ^https?://(www\.)?stumbleupon\.com/(submit\?|badge/embed/) ^https?://(www\.)?technorati\.com/faves/?\?add= ^https?://(www\.)?twitter\.com/(share\?|intent/((re)?tweet|favorite)|home/?\?status=|\?status=) ^https?://(www\.)?xing\.com/(app/user\?op=share|social_plugins/share\?) ^https?://(www|draft)\.blogger\.com/(navbar\.g|post-edit\.g|delete-comment\.g|comment-iframe\.g|share-post\.g|email-post\.g|blog-this\.g|delete-backlink\.g|rearrange|blog_this\.pyra)\? ^https?://(www|px\.srvcs)\.tumblr\.com/(impixu\?|share(/link/?)?\?|reblog/) ^https?://(www|ssl)\.google-analytics\.com/(r/)?(__utm\.gif|collect\?) ^https?://.+/.+/disqus\.com/forums/$ ^https?://.+/js-agent\.newrelic\.com/nr-\d{3}(\.min)?\.js$ ^https?://.+/js/chartbeat\.js$ ^https?://.+/stats\.g\.doubleclick\.net/dc\.js$ ^https?://.+\.blogspot\.(com|in|com\.au|co\.uk|jp|co\.nz|ca|de|it|fr|se|sg|es|pt|com\.br|ar|mx|kr)/(\d{4}/\d{2}/|search/label/)(CSI/$|.*/CSI/CSI/CSI/) ^https?:///(wp-admin/|wp-login\.php\?) ^https?:///.*%5Cx26route=/archive ^https?:///.*& ^https?:///.*(\?|%5Cx26)route=(/page/:page|/archive/:year/:month|/tagged/:tag|/post/:id|/image/:post_id) ^https?:///.*amp%3Bamp%3Bamp%3B ^https?:///.+/%3Ca%20href= ^https?:///.+/jetpack-comment/\?blogid=\d+&postid=\d+ ^https?:///.+/plugins/ultimate-social-media-plus/.+/like/like/ ^https?:///.+/quote-comment-\d+/$ ^https?:///.+[\?&](replyto(com)?|like_comment)=\d+ ^https?:///.+[\?&]mode=reply ^https?:///.+[\?&]share=[a-z]{4,} ^https?:///.+\?showComment(=|%5C)\d+ ^https?:///search(/label/[^\?]+|\?q=[^&]+|)[\?&]updated-(min|max)=\d{4}-\d\d-\d\dT\d\d:\d\d:\d\d.*&max-results=\d+ ^https?://[^/]*musicproxy\.s12\.de/ ^https?://[^/]+/.+/CaptchaImage\.axd ^https?://[^/]+/anony/mjpg\.cgi$ ^https?://[^/]+/mjpg/video\.mjpg ^https?://[^/]+\.akadostream\.ru(:\d+)?/ ^https?://[^/]+\.corp\.ne1\.yahoo\.com/ ^https?://[^/]+\.facebook\.com/login\.php ^https?://[^/]+\.gaduradio\.pl/ ^https?://[^/]+\.libsyn\.com/.+/%2[02]https?:/ ^https?://[^/]+\.rastream\.com(:\d+)?/ ^https?://[^/]+\.services\.livejournal\.com/ljcounter ^https?://[^/]+\.streamtheworld\.com/ ^https?://[^/]+\.xiti\.com/hit\.xiti\? ^https?://[^\./]+\.radioscoop\.(com|net):\d+/ ^https?://[^\./]+\.streamchan\.org:\d+/ ^https?://[^\.]+\.livejournal\.com/.+/\*sup_ru/ru/UTF-8/ ^https?://[^\.]+\.livejournal\.com/.+http://[^\.]+\.livejournal\.com/ ^https?://[a-z0-9]+\.cdn\.dvmr\.fr(:\d+)?/.+\.mp3 ^https?://\d+\.media\.tumblr\.com/avatar_.+_16\.pn[gj]$ ^https?://accounts\.google\.com/(SignUp|ServiceLogin|AccountChooser|a/UniversalLogin) ^https?://add\.my\.yahoo\.com/(rss|content)\? ^https?://air\.radiorecord\.ru(:\d+)?/ ^https?://alb\.reddit\.com/ ^https?://api\.addthis\.com/ ^https?://audio\d?\.radioreference\.com/ ^https?://audiots\.scdn\.arkena\.com/ ^https?://av\.rasset\.ie/av/live/ ^https?://b\.hatena\.ne\.jp/add\? ^https?://b\.scorecardresearch\.com/ ^https?://beacon\.wikia-services\.com/ ^https?://bookmark\.naver\.com/post\? ^https?://bufferapp\.com/add\? ^https?://connect\.mail\.ru/share\? ^https?://csp\.cyworld\.com/bi/bi_recommend_pop\.php\? ^https?://del\.icio\.us/post\? ^https?://delicious\.com/(save|post)\? ^https?://download\.ted\.com/ ^https?://flattr.com/submit/auto\? ^https?://gcnplayer\.gcnlive\.com/.+ ^https?://geo\.yahoo\.com/b\? ^https?://getpocket\.com/(save|edit)/?\? ^https?://i\.dev\.cdn\.turner\.com/ ^https?://imageshack\.com/lost$ ^https?://iwiw\.hu/pages/share/share\.jsp\? ^https?://mail\.google\.com/mail/ ^https?://media\.opb\.org/clips/embed/.+\.js$ ^https?://medium\.com/_/(vote|bookmark|subscribe)/ ^https?://memori(\.qip)?\.ru/link/\? ^https?://mp3\.ffh\.de/ ^https?://mp3tslg\.tdf-cdn\.com/ ^https?://myweb2\.search\.yahoo\.com/myresults/bookmarklet\? ^https?://news\.ycombinator\.com/submitlink\? ^https?://p\.opt\.fimserve\.com/ ^https?://photobucket\.com/.+/albums/.+/albums/ ^https?://pixel\.(quantserve|wp)\.com/ ^https?://pixel\.blog\.hu/ ^https?://pixel\.redditmedia\.com/pixel/ ^https?://platform\d?\.twitter\.com/widgets/tweet_button.html\? ^https?://play(\d+)?\.radio13\.ru:8000/ ^https?://plus\.google\.com/share\? ^https?://posterous\.com/share\? ^https?://prod-preview\.wired\.com/ ^https?://pub(\d+)?\.di\.fm/ ^https?://r-a-d\.io/.+\.mp3$ ^https?://r-login\.wordpress\.com/remote-login\.php ^https?://relay\.broadcastify\.com/ ^https?://reporter\.es\.msn\.com/\?fn=contribute ^https?://s\d+\.sitemeter\.com/(js/counter\.js|meter\.asp) ^https?://service\.weibo\.com/share/share\.php\? ^https?://share\.flipboard\.com/bookmarklet/popout\? ^https?://social-plugins\.line\.me/lineit/share ^https?://sphinn\.com/index\.php\?c=post&m=submit& ^https?://static\.licdn\.com/sc/p/.+/f// ^https?://static\.licdn\.com/sc/p/com\.linkedin\.nux(:|%3A)nux-static-content(\+|%2B)[\d\.]+/f/ ^https?://stream(\d+)?\.media\.rambler\.ru/ ^https?://telegram\.me/share/url\? ^https?://tm\.uol\.com\.br/h/.+/h/ ^https?://tmz\.vo\.llnwd\.net/ ^https?://video-subtitle\.tedcdn\.com/ ^https?://vkontakte\.ru/share\.php\? ^https?://vuible\.com/pins-settings/ ^https?://web\.archive\.org/web/[^/]+/https?\:/[^/]+\.addthis\.com/.+/static/.+/static/ ^https?://wow\.ya\.ru/posts_(add|share)_link\.xml\? ^https?://www\.addthis\.com/bookmark\.php\? ^https?://www\.addtoany\.com/(add_to/|share_save\?) ^https?://www\.amazon\.com/.+/logging/log-action\.html ^https?://www\.blinklist\.com/index\.php\?Action=Blink/addblink\.php ^https?://www\.blogger\.com/feeds/\d+/\d+/comments/default/\d+ ^https?://www\.blogger\.com/feeds/\d+/posts/default/\d+ ^https?://www\.dreamwidth\.org/tools/(memadd|tellafriend)\? ^https?://www\.flickr\.com/(explore/|photos/[^/]+/(sets/\d+/(page\d+/)?)?)\d+_[a-f0-9]+(_[a-z])?\.jpg$ ^https?://www\.flickr\.com/change_language\.gne ^https?://www\.google\.com/(reader/link\?|buzz/post\?) ^https?://www\.google\.com/accounts/AccountChooser ^https?://www\.google\.com/bookmarks/mark\? ^https?://www\.google\.com/recaptcha/(api|mailhide/d\?) ^https?://www\.infomous\.com/cloud_widget/lib/lib/ ^https?://www\.khaleejtimes\.com/.+/images/.+/images/ ^https?://www\.khaleejtimes\.com/.+/imgactv/.+/imgactv/ ^https?://www\.khaleejtimes\.com/.+/kt_.+/kt_ ^https?://www\.linkedin\.com/(cws/share|shareArticle)\? ^https?://www\.livejournal\.com/(tools/memadd|update|(identity/)?login)\.bml\? ^https?://www\.netvibes\.com/subscribe\.php\? ^https?://www\.newsvine\.com/_wine/save\? ^https?://www\.odnoklassniki\.ru/dk\?st\.cmd=addShare ^https?://www\.warnerbros\.com/\d+$ ^https?://www\.youtube\.com/.*\[\[.+\]\] ^https?://www\.youtube\.com/.*\{\{.+\}\} ^https?://zakladki\.yandex\.ru/newlink\.xml\? Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Imported /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb/max_content_length https://files.catbox.moe/1e8fxt.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Imported /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb/delay Imported /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb/concurrency /home/ubuntu/gs-venv/lib/python3.8/site-packages/wpull/protocol/http/client.py:185: UserWarning: HTTP session did not complete. warnings.warn(_('HTTP session did not complete.')) Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/1zlkq7.png ... https://files.catbox.moe/3rqahe.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/4odcd3.png ... https://files.catbox.moe/4shok7.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/6j73hr.png ... https://files.catbox.moe/75op29.png ... https://files.catbox.moe/bgcloq.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/boks3t.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/cz3b31.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/diqs61.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/em9y06.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/fry59i.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/gj8ss7.png ... https://files.catbox.moe/gym7km.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/k7yg71.png ... https://files.catbox.moe/me9ooo.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/ndr52m.png ... https://files.catbox.moe/nnoa2p.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/o5kc9g.png ... https://files.catbox.moe/q0ler2.png ... https://files.catbox.moe/q4w1f7.png ... https://files.catbox.moe/rqcap4.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/rwk10a.png ... https://files.catbox.moe/rxcgbb.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/s0rdun.png ... https://files.catbox.moe/sxqm0w.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/t0kwc9.png ... https://files.catbox.moe/takdt3.png ... https://files.catbox.moe/wts4q0.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/wyip7e.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/xmky3f.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/z7bmbj.png ... https://files.catbox.moe/zupxeg.png ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") https://files.catbox.moe/robots.txt ... https://files.catbox.moe/sitemap.xml ... https://files.catbox.moe/robots.txt ... https://files.catbox.moe/sitemap.xml ... https://files.catbox.moe/robots.txt ... https://files.catbox.moe/sitemap.xml ... Disconnected from ws:// server: ConnectionRefusedError(111, "Connect call failed ('127.0.0.1', 29000)") Finished grab 1187e1fb68f4a01c106de0caef23febe file:///z9/warc/014/in.txt with exit code 8 Output is in directory: /z9/warc/014/z9-warc-014-in.txt-2024-04-12-1187e1fb Task was destroyed but it is pending! task: wait_for=()]>> 2024-04-12T17:50:54.729795443Z $