๊ธฐํƒ€(๊ฐœ๋ฐœ)/ํฌ๋กค๋ง(Crawling)

    [python] ์ธ์Šคํƒ€๊ทธ๋žจ ํฌ๋กค๋ง ํ•˜๊ธฐ(instagram Crawling)

    ๋จผ์ € ์ž๋ฐ”์Šคํฌ๋ฆฝํŠธ๊ฐ€ ์•„๋‹๋•Œ ํฌ๋กค๋ง ํ•˜๋Š”๋ฒ• https://10000sukk.tistory.com/3 [python]๋ฌด์‹ ์‚ฌ ํฌ๋กค๋ง ํ•˜๊ธฐ Crawling ๋จผ์ € url์„ ๋ฐ›๋Š”๋‹ค baseUrl = 'https://store.musinsa.com/app/product/search?search_type=1&q=' baseUrl1 = '&page=' plusUrl = input('๊ฒ€์ƒ‰ํ•  ์˜ท์„ ์ž…๋ ฅํ•˜์‹œ์˜ค: ') pageNum =1 url = baseUrl + quote_plus(plus.. 10000sukk.tistory.com ์ธ์Šคํƒ€๋Š” ์ž๋ฐ”์Šคํฌ๋ฆฝํŠธ ํŽ˜์ด์ง€, ์ฆ‰, ๊ทธ์—๋งž๋Š” ๋ฐฉ์‹์œผ๋กœ ํฌ๋กค๋ง ์š”๊ตฌ๋ฉ๋‹ˆ๋‹ค. ์ €๋Š” ๋ฏธํกํ•˜์ง€๋งŒ ํŽ˜์ด์ง€๋ฅผ ํฌ๋กค๋ง ํ•˜๊ธฐ์œ„ํ•ด ์ƒˆ๋กœ ๋ถˆ๋Ÿฌ์˜ค๊ณ  ํฌ๋กค๋ง ํ•˜๊ณ  -> ์ƒˆ๋กœ ๋ถˆ๋Ÿฌ์˜ค๊ณ  ํฌ๋กค๋งํ•˜๊ณ  -> ์ƒˆ๋กœ ๋ถˆ๋Ÿฌ......

    [python]๋ฌด์‹ ์‚ฌ ํฌ๋กค๋ง ํ•˜๊ธฐ Crawling

    ๋จผ์ € url์„ ๋ฐ›๋Š”๋‹ค baseUrl = 'https://store.musinsa.com/app/product/search?search_type=1&q=' baseUrl1 = '&page=' plusUrl = input('๊ฒ€์ƒ‰ํ•  ์˜ท์„ ์ž…๋ ฅํ•˜์‹œ์˜ค: ') pageNum =1 url = baseUrl + quote_plus(plusUrl) + baseUrl1 + str(pageNum) quote_plus๋Š” ํŠน์ˆ˜๋ฌธ์ž๋‚˜ ๋‹ค๋ฅธ ํ˜•์‹์˜ ๋ฌธ์ž๋ฅผ ์•„์Šคํ‚ค ์ฝ”๋“œ๋กœ ๋ณ€ํ™˜ํ•ด์ฃผ๊ณ  ๊ณต๋ฐฑ์„ '+'๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค. ์ฐธ๊ณ ๋กœ, quote()๋Š” ๊ณต๋ฐฑ์„ '%20'์œผ๋กœ ๋ณ€ํ™˜ํ•œ๋‹ค. ์ด๋ฅผ ์ด์ œ from selenium import webdriver ์„ ์ด์šฉํ•ด์„œ webdriver.Chrome()์œผ๋กœ ์—ด์ˆ˜๊ฐ€ ์žˆ๋Š”๊ฒƒ์ด๋‹ค. ์ด๊ฒŒ ๋ฌด์Šจ ๋ง์ด๋ƒ๋ฉด selenium..