Question

我正在尝试将网站网址（例如：www.google.com）转换为PDF文件。但我没有得到预期的产出。

我试图从网址获得回复。但我不知道如何解析它并将其写入PDF。

我可以使用以下实用程序。

itextsharp dll
Aspose实用程序

任何人都可以指导我如何使用c＃console应用程序中给定网址的上述实用程序转换为PDF。

真的很感谢你的帮助，因为我最近两天都坚持这个......

如果openxml工作正常......

以下是我尝试过的代码。

     WebRequest wReq = WebRequest.Create("http://www.economist.com/news/united-states/21647625-150-years-after-end-civil-war-states-were-once-confederate-remain");
            WebResponse wRsp = null;

            try
            {
                wRsp = wReq.GetResponse();
            }
            catch (Exception e)
            {
                Console.WriteLine("{0} exception caught.", e);
            }

            string strPDFpath = @"D://MyPDF.pdf";
            Console.WriteLine(((HttpWebResponse)wRsp).StatusDescription);
            Stream sRsp = wRsp.GetResponseStream();
            StreamReader sRdr = new StreamReader(sRsp);
            string sHtml = string.Empty;
            sHtml = sRdr.ReadToEnd();
            Console.WriteLine(sHtml);
            StringReader sr = new StringReader(sHtml);

            using (StringWriter strWrite = new StringWriter())
            {
                using (HtmlTextWriter htmlWrite = new HtmlTextWriter(strWrite))
                {
                    //To Export all pages
                    StringReader read = sr;
                    Document targetDoc = new Document(PageSize.A2, 10f, 10f, 10f, 0f);
                    HTMLWorker htmlparser = new HTMLWorker(targetDoc);                        
                    PdfWriter.GetInstance(targetDoc, new FileStream(strPDFpath, FileMode.Create));
                    targetDoc.Open();
                    htmlparser.Parse(read);
                    targetDoc.Close();
                }
            }
        }
        catch (Exception ex) 
        {
            throw ex;
        }

我收到错误＆＃34; Path中的非法字符。＆＃34;在下面。

    htmlparser.Parse(read);

将网站网址转换为PDF文件

0 个答案: